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DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 
ACYLTRANSFERASE AND USES THEREOF 

5 

This application is a Continuation-In-Part of U.S. Serial 
No. 08/657,620, filed May 30, 1996, the content of which 
is incorporated by reference into this application. 

10 Throughout this application, various publications are 

referenced by Arabic numerals. Full citations for these 
publications may be found listed at the end of the 
specification. The disclosures of these publications in 
their entireties are hereby incorporated by reference 

15 into this application in order to more fully describe the 

state of the art as known to those skilled therein. 

Background of the Invention 

Cholesterol or related sterols, required for the 

20 viability of eukaryotic cells, exist in the free form or 

as esters conjugated to fatty acids. The concentration 
of free sterol determines the fluidity of eukaryotic cell 
membranes, whereas esterified sterols cannot participate 
in membrane assembly. The esterif ication of 

25 intracellular sterol, mediated in mammals by the 

membrane-bound enzyme, acylcoenzyme A: cholesterol 
acyltransferase, is thus a critical homeostatic 
determinant of membrane function (1, 2). For example, 
cholesterol depletion of the rough endoplasmic reticulum 

30 (ER) relative to the smooth ER (3), may modulate protein 

translocation or membrane-associated transcriptional 
activators such as the Sterol Response Element Binding 
proteins {SREBP, 4). In addition, production of 
cholesterol ester <CE) by acylcoenzyme A: cholesterol 

35 acyltransferase in the rough ER may influence the 

transport of sterol between intracellular pools. Similar 
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esterif ication activities have been observed in other 
eukaryotes such as plants and yeasts (5) . 

Elevations in acylcoenzyme A: cholesterol acyltransf erase 
5 activity perturb several pathways that contribute to 
hyper lipidemia and atherosclerosis. Sterol 
esterif ication modifies the activity of the low density 
lipoprotein (LDL) receptor and alters serum lipoprotein 
composition to be pro-atherogenic (6, 7) . . It may also be 

10 a rate limiting step in intestinal sterol absorption (8) . 

Furthermore, CE deposition in the arterial wall is an 
important initial step in atherogenesis (9) . The 
understanding of the acylcoenzyme A; cholesterol 
acyltransf erase reaction has been hampered by the 

15 difficulty of biochemical purification and by a poor 
grasp of the relevant genetic determinants. A human 
acylcoenzyme A: cholesterol acyltransf erase I gene from 
macrophages was identified by complementation of Chinese 
Hamster Ovary cell lines deficient in acylcoenzyme A: 

20 cholesterol acyltransf erase activity (10) and was 

functionally expressed in insect cells devoid of 
endogenous activity (11) . 



25 
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Summary of the Invention 

This invention provides an isolated nucleic acid which 
encodes an acylcoenzyme A: cholesterol acyltransf erase II 
5 or an acylcoenzyme A: cholesterol acyltransf erase III. 

This invention also provides a vector which includes the 
isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransf erase II or an acylcoenzyme A: 
10 cholesterol acyltransf erase III and a host vector system 
which includes a vector. 

This invention also provides a method of producing a 
polypeptide which comprises growing such host vector 

15 system of claim 14 under suitable conditions permitting 

production of the polypeptide and recovering the 
polypeptide so produced. This invention also provides a 
purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or an acylcoenzyme A: cholesterol 

20 acyltransf erase III. 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 

25 nucleic acid which encodes a wildtype acylcoenzyme A: 

cholesterol acyltransf erase II or an acylcoenzyme A: 
cholesterol acyltransf erase III without hybridizing to a 
nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or an acylcoenzyme A: 

30 cholesterol acyltransf erase III. This invention also 

provides an oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic acid 
which encodes a mutant acylcoenzyme A: cholesterol 

35 acyltransf erase II or an acylcoenzyme A: cholesterol 
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acyltransf erase III without hybridizing to a nucleic acid 
which encodes a wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or an acylcoenzyme A: cholesterol 
acyltransf erase III. 

5 

This invention also provides a method for determining 
whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterif ication of sterol and for treating a subject who 
10 has an imbalance in sterol levels due to a defect in 

esterif ication of sterol. 



This invention also provides methods for inhibiting 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 
15 or an acylcoenzyme A: cholesterol acyltransf erase III in 

a subject. 

This invention also provides a method for identifying a 
chemical compound which is capable of inhibiting 
20 acylcoenzyme A: cholesterol acyltransf erase II or an 

acylcoenzyme A: cholesterol acyltransf erase III in a 
subject and a pharmaceutical composition comprising of 
the chemical compound so identified. 



25 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransf erase II or an 
acylcoenzyme A: cholesterol acyltransf erase III. 



30 
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Brief Description of the Figures 

Abbreviations: The amino acid residues are abbreviated 
as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, 
5 Gly; H, His; I, He; K, Lys; L, Leu; M # Met; N, Asn; P, 

Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y, Tyr. CON: consensus sequence. 



10 



Figures 1A and IB. Protein sequence alignments predicted 
from candidate genes for the human acyl coenzyme A: 
cholesterol acyl transferase gene I, the yeast homologs, 
acylcoenzyme A : cholesterol acyl trans ferase-rela ted 
enzyme I and acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme II, and a consensus sequence of all three 
1 5 sequences . 

Identical residues between all the sequences 
are in bold face. Residues of the candidate 
leucine zipper heptad motif are italicized. 
Potential transmembrane domains were identified 

20 at residues 132 to 155 and 460 to 483; 186 to 

202 and 406 to 421; and 215 to 231 and 439 to 
451, for human acylcoenzyme A: cholesterol 
acyltransf erase (Sequence I.D. No.: 2), 
acylcoenzyme A: cholesterol acyltransf erase- 

25 related enzyme I (Sequence I.D. No.: 4) and 

acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme II (Sequence I.D. No.: 6), 
respectively. The firefly luciferase signature 
sequences identified in human acylcoenzyme A: 

30 cholesterol acyltransf erase I (10) were not 

conserved in the yeast genes. CON (Sequence 
I.D. No.: 13) denotes the consensus sequence 
between the sequences of human acylcoenzyme A: 
cholesterol acyltransf erase, acylcoenzyme A: 
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cholesterol acyltransf erase-related enzyme 1 
and acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme II. R07932 
denotes the partial sequence of another human 
5 acylcoenzyme A: cholesterol acyltransf erase 

candidate cDNA (residues 500 to 600) (Sequence 
I.D. No.: 14). The asterisks indicate the 
residues in R07932 identical to those of the 
other sequences. 

10 1A. Alignment of amino acid residues 1-362 of 

acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme II, human acylcoenzyme A: 

15 cholesterol acyltransf erase and CON. 

IB. Alignment of amino acid residues 363-611 of 

acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyltransf erase- 

20 related enzyme II/ human acylcoenzyme A: 

cholesterol acyltransf erase and CON. 



Figures 2A, 2B, 2C, 2D and 2E. Construction and analysis 
of acylcoenzyme A: cholesterol acyl transferase genes and 
25 deletion mutants. 

2A. The arelANA deletion. The schematic depicts a 

fragment from yeast chromosome III in plasmid 
pH3(34). Strategic restriction endonucleases 
are indicated <H, Hind III; B, Bam HI). 
30 2B. The autoradiogram depicts Bam HI digested DNA 

from wild-type or disrupted diploid strains 
probed with the 2993-bp Bam-HI fragment. This 
produced a fragment corresponding to the 
wild-type acylcoenzyme A: cholesterol 
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acyltransferase-related enzyme I locus and a 
1984-bp fragment characterizing the arelA NA 
allele. The diploid is heterozygous for the 
acylcoenzyme A: cholesterol acyltransf erase- 
5 related enzyme I deletion. 

2C. Reduced stringency hybridization of yeast 

genomic DNA with acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I coding 
sequences. Genomic DNA from wild- type or 
10 AREl/areI4AW diploids were reprobed with an Nhe 

I-Avr II fragment corresponding to the 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I open reading frame ("ORF") . 
Hybridizations and washes were performed at 
15 60 °C in the absence of formamide. 

2D. The are2A deletion. In step 1, PCR amplifying 

oligonucleotides, KO-5' and KO-3' and a LEU2 
template were used to produce the selectable 
yeast gene flanked at the 5' and 3' ends by 
20 acylcoenzyme A: cholesterol acyltransferase- 

related enzyme II. In step 2, this was used to 
direct homologous recombination at acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II by transformation of a diploid strain and 
selection for leucine protrophy. In step 3, 
integrants to acylcoenzyme A; cholesterol 
acyltransferase-related enzyme II were 
identified by a PCR reaction using 
oligonucleotides flanking ARE2 (are2-5' and 
are2-3'j and a 3' amplimer within LEU2 (L2-3 f ). 

2E. A 999-bp fragment identifies are2A, as shown in 

the ethidium bromide stained agarose gel. The 
wild-type fragment (2206-bp) is also produced 



25 



30 
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in the same reaction. Leucine prototrophic 
transformants with deletions of acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II 
were obtained at a frequency of -2%. M 
5 indicates the 50-2,000-bp ladder markers 

(Bio-Rad Laboratories) . 

Figures 3A and 3B. Fluorescent staining of triglyceride 
and sterol ester. 

10 The cells were grown in YEPD to stationary 

phase, washed with deionized H ? 0, and incubated 
with 1 Mg/ml Nile Red (1 mg/nil in acetone) . 
Fluorescent images were obtained with a BioRad 
MRC600 laser scanning confocal microscope 

15 (BioRad Microscience/ Hercules, CA) on an 

inverted Zeiss Atiovert microscope (Zeiss, 
OberKochem, Germany) using 63X (NA1.4) Zeiss 
Plan-apo infinity corrected objective. Samples 
were illuminated with the 488nm line from an 

20 argon ion laser and the fluorescence was 

visualized with a 540nm dichroic mirror and 
550nm long-pass emission filter. Staining of 
the cytoplasmic lipid droplets was sensitive to 
treatment with isopropanol, proving them to be 

25 lipid in nature. 

3A. Wild-type cells. 

3B. arelANAare2A double mutant cells. 

Figures 4A, 4B, 4C and 4D. Neutral lipid and sterol 
30 biosynthesis in ARE deletion mutants. 

Strain genotypes are as described in the text; 
dpm/mg dry weight: disintegrations per minute 
per milligram of dry weight of cells. 
4A. Triglyceride biosynthesis. Total lipids were 
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extracted from cells grown in media containing 
3 H-oleate and analyzed by thin-layer 
chromatography . 

Sterol ester biosynthesis. Total lipids were 
5 extracted from cells grown in media containing 

3 H-oleate and analyzed by thin-layer 
chromatography . 
4C - Sterol ester biosynthesis in wild-type and 

mutant cells transformed with vector control 
10 (black box) or acylcoenzyme A: cholesterol 

acyl trans ferase-rela ted enzyme I 
over-expression plasmids, YEp3-16 (increased 
copy number, shaded box) and pADH5-36 
(transcription from the ADH promoter, open 
15 boxes) . Cells were grown in selective media to 

maintain the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I expression 
plasmids- Lipids were labeled, extracted and 
analyzed as above. 
20 4D - Sterol biosynthesis in acylcoenzyme A: 

cholesterol acyltransferase-related enzyme 
deletion mutants. Lipids were labeled in 
synthetic complete media containing [1- 11 C] 
acetate, saponified and extracted with hexane 
25 and subjected to thin layer chromatography 

analysis. The data is representative of three 
separate experiments and expressed as the ratio 
of incorporation into sterols to incorporation 
into fatty acids. 

30 

Figures 5A, 5B, 5C, 5D, 5E and 5F. The nucleic acid and 
amino acid or predicted amino acid sequences. 

5A-1 - 5A-3. 

The nucleic acid sequence of human acylcoenzyme 
35 A: cholesterol acyltransf erase I designated 
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Sequence ID No.: 1. The amino acid sequence of 
human acylcoenzyme A: cholesterol 
acyltransferase I designated Sequence ID No.: 
2. 

5A-1. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 
acyltransferase I from nucleic acid 
bases 1-1624. Amino acid sequence of 
human acylcoenzyme A: cholesterol 
acyltransferase I from amino acid 
residues 1-76 . 

5A-2. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 
acyltransferase I from nucleic acid 
bases 1625-2524. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acyltransferase I from amino acid 
residues 77-376. 

5A-3. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 
acyltransferase I from nucleic acid 
bases 2525-3649. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acyltransferase I from amino acid 
residues 377-551. 

-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransf erase-related enzyme 
I designated Sequence ID No.: 3. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyltransf erase-related enzyme I 
designated Sequence ID No.: 4. 

5B-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransf erase- 
related enzyme I from nucleic acid 
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bases 1-1289. Amino acid sequence of 
acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 1-209. 

5B-2. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 1290-2114. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 210-484. 

5B-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 2115-2601. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 485-611. 

-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II designated Sequence ID No. : 5. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II 
designated Sequence ID No.: 6. 
5C-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
bases 1-1061. Amino acid sequence of 
acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II 
from amino acid residues 1-238. 
5C-2. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
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bases 1062-1961. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme II 
from amino acid residues 239-538 . 
5 5C-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol * acyltransf erase- 
related enzyme II from nucleic acid 
bases 1962-2421. Amino acid sequence 
of acylcoenzyme A: cholesterol 
10 acyltransferase-related enzyme II 

from amino acid residues 539-643. 

5D. The nucleic acid sequence of mouse acylcoenzyme 

A: cholesterol acyltransf erase II designated 
15 Sequence ID No.: 11. The amino acid sequence 

of mouse acylcoenzyme A: cholesterol 
acyltransf erase II designated Sequence ID No.: 
12. 

20 Figure 6A. A restriction map of the expression vector 

YepAB-ACAT2 . 

Figure 6B and 6C. Expression of human macrophage ACAT 
in pRS426GP. 

25 6B. The ACAT open reading frame was inserted 

at the WotI and Sad sites, downstream of 
the promoter of the GAL1/10 gene 
(GALl/lOp) as described in the text to 
produce pRS426-ACAT. URA3 and Amp r denote 

30 selectable markers for yeast and E. coll 

respectively. The yeast and bacterial 
origins of replication (2^m and ori, 
respectively ) are indicated. 
6C. Immunoblot of human ACAT in protein 
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extracts from cells transformed with 
pRS426-ACAT. Double mutant cells (arel 
are2~) , transformed with pRS426-ACAT 
(hACAT) or with pRS426GP (vector), were 
5 induced by growth in galactose. Proteins 

were analyzed by immunoblotting. 
Equivalent amounts of protein extracts 
from mouse adrenal cells were loaded for 
comparison. Molecular weight reference 
10 markers (BioRad) are indicated (M) . The 

arrow indicates the position of the DM10 
immunoreactive product in extracts from 
murine adrenals. The expressed form of 
hACAT in yeast is of coincident mobility. 
15 Figures 7A and 7B. Multiple human tissue Northern 

analysis of poly (A)+ RNAs probed with 32 P-labeled cDNA 
CI. 

7A - Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransf erase II 
20 using a wildtype acylcoenzyme A: cholesterol 

acyltransf erase II specific probe. 
7B - Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransf erase I 
using a wildtype acylcoenzyme A: cholesterol 
25 acyltransf erase I specific probe. 

Figure 8A, SB, 8C and 8D. Tissue specific expression of 
ARGP1 and hACAT. 

8A and 8B. Multiple tissue Northerns (Clontech) with 

indicated samples were probed with an 
ARGP1 specific probe as described in the 
text. 

8C and 8D. The same blots were also analyzed using a 

hACAT specific probe. The first panel is 



30 
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identical to that published by Chang et al 
(8) . The second panel is the same blot as 
in A and B, probed with the ACAT cDNA 1600 
bp probe. 

5 

Figure 9. Fetal Tissue specific expression of AGRP2. 

Multiple tissue Northerns of fetal tissue (Clontech) 
with indicated samples, were probed with and AGRP2 
specific probe as described in the text. 

10 

Figure 10, Cultured cell expression of AGRP2. 

RNA samples from HepG2 and CV1 were reverse 
transcribed and PCR amplified as described in the 
text. P indicate a plasmid template control. The 
15 blank lanes represent water or no RT controls. 

Figure 11. Sequence comparison of human ACAT and AGBP1 

Figure 12. Sequence comparison of human ACAT and AGBP2 

Figure 13. Phylogenetic Comparisons of ACAT like 
molecules . 

The sequences shown were identified in genome 
databases and aligned based on protein sequence 
using GCG Inc software (pileup) . They were 
subsequently arranged to their sequence conservation 
to determine approximate evolutionary relatedness. 

Figure 14 . Conserved motifs in ACAT relate gene 
30 products. 

Figure 15A and 15B. Nucleotide and predicted protein 
sequence of ARGP1 . 



20 
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Figure 16. Nucleotide and predicted protein sequence of 
ARGP2. 
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Detailed Description of the Invention 

Throughout this application, references to specific 
nucleotides are to nucleotides present on the coding 
5 strand of the nucleic acid. The following standard 

abbreviations are used throughout the specification to 
indicate specific nucleotides: 

C=cytosine A=adenosine 
10 T=thymidine G=guanosine 

A "gene" means a nucleic acid molecule, the sequence of 
which includes all the information required for the 
normal regulated production of a particular protein, 
15 including the structural coding sequence, promoters and 

enhancers . 

The nucleic acids or oligonucleotides of the subject 
invention also include nucleic acids or oligonucleotides 

20 coding for polypeptide analogs, fragments or derivatives 

which differ from naturally-occurring forms in terms of 
the identity or location of one or more amino acid 
residues (deletion analogs containing less than all of 
the residues specified for the protein, substitution 

25 analogs wherein one or more residues specified are 

replaced by other residues and addition analogs where in 
one or more amino acid residues is added to a terminal or 
medial portion of the polypeptides) and which share some 
or all properties of naturally-occurring forms. These 

30 nucleic acids or oligonucleotides include: the 

incorporation of codons "preferred" for expression by 
selected non-mammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or intermediate 

35 DNA sequences that facilitate construction of readily 
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expressed vectors. 



10 



The nucleic acids and oligonucleotides described and 
claimed herein are useful for the information which they 
provide concerning the amino acid sequence of the 
polypeptide and as products for the large scale synthesis 
of the polypeptide by a variety of recombinant 
techniques. The molecule is useful for generating new 
cloning and expression vectors, transformed and 
transfected prokaryotic and eukaryotic host cells, and 
new and useful methods for cultured growth of such host 
cells capable of expression of the polypeptide and 
related products. 

15 An isolated nucleic acid which encodes an acylcoenzyme A: 

cholesterol acyltransf erase II. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
Specifically, the isolated nucleic acid has the sequence 
designated Seq. i.d. No.: 7. The isolated nucleic acid 
encodes a human wildtype acylcoenzyme A: cholesterol 
acyltransferase II having substantially the same amino 
acid sequence as the sequence designated Seq. I.D. No.: 
8. Specifically the isolated nucleic acid has the 
sequence designated Seq. I.D. No.: 11. The isolated 
25 nucleic acid encodes a mouse wildtype acylcoenzyme A: 

cholesterol acyltransferase II having substantially the 
same amino acid sequence as the sequence designated Seq. 
I.D. No.: 12. Further, the isolated nucleic acid of 
encodes a mutant acylcoenzyme A: cholesterol 
30 acyltransferase II. 

An isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransferase III. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
35 Specifically, the isolated nucleic acid has the sequence 
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as set forth in Fig. 16. The isolated nucleic acid 
encodes a human wildtype acylcoenzyme A: cholesterol 
acyltransferase III having substantially the same amino 
acid sequence as set forth in Fig. 16. Further, the 
5 isolated nucleic acid of encodes a mutant acylcoenzyme A: 

cholesterol acyltransferase III. 

As used in this application, "acylcoenzyme A: cholesterol 
acyltransferase III" means and includes any polypeptide 
having acylcoenzyme A: cholesterol acyltransferase III 
activity and having an amino acid sequence homologous to 
the amino acid sequence of human acylcoenzyme A: 
cholesterol acyltransferase II (the sequence of which is 
set forth in Fig. 15) . Thus, this term includes any such 
polypeptide whether naturally occurring and obtained by 
purification from natural sources or non-naturally 
occurring and obtained synthetically, e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 
same as, or identical to the sequence of any mammalian 
homolog of the human polypeptide, e.g. murine, bovine, 
porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 
nonmutants or nonvariants. 

As used in this application, "acylcoenzyme A: cholesterol 
acyltransferase II" means and includes any polypeptide 
having acylcoenzyme A: cholesterol acyltransferase II 
30 activity and having an amino acid sequence homologous to 

the amino acid sequence of human acylcoenzyme A: 
cholesterol acyltransferase III (the sequence of which is 
set forth in Fig. 16) . Thus, this term includes any such 
polypeptide whether naturally occurring and obtained by 
35 purification from natural sources or non-naturally 
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occurring and obtained synthetically, e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 
same as, or identical to the sequence of any mammalian 
homolog of the human polypeptide, e.g. murine, bovine, 
porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 
nonmutants or nonvariants . 

The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyl transferase II, but which 
do not produce phenotypic changes. 

The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyltransf erase III, but 
which do not produce phenotypic changes. 

The nucleic acid of the subject invention also include 
nucleic acids that encode for polypeptide analogs, 
fragments or derivatives which differ from naturally- 
occurring forms in terms of the identity or location of 
one or more amino acid residues (including deletion 
analogs containing less than all of the residues 
specified for the protein, substitution analogs wherein 
one or more residues specified are replaced by other 
residues and addition analogs wherein one or more amino 
acid residues is added to a terminal or medial portion of 
the polypeptides) and which share some or all properties 
of the naturally-occurring forms. 



The polypeptide of the subject invention also includes 
analogs, fragments or derivatives which differ from 
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naturally-occurring forms, but having acylcoenzyme A: 
cholesterol acyltransf erase activity. 

This invention also provides a vector comprising an 
5 isolated nucleic acid encoding acylcoenzyme A: 

cholesterol acyltransf erase II or III. The isolated 
nucleic acid of the vectors is operatively linked to a 
promoter of RNA transcription which maybe, or is 
identical to, a bacterial, yeast, insect or mammalian 
10 promoter. The vector may be a plasmid, cosmid, yeast 

artificial chromosome (YAC) , bacteriophage or eukaryotic 
viral DNA. Specifically, this invention provides a 
vector designated YepAB-ACAT2 (Figure 6) . 

15 Further other numerous vector backbones known in the art 

as useful for expressing proteins may be employed. Such 
vectors include but are not limited to: adenovirus, 
simian virus 40 (SV40) , cytomegalovirus (CMV) , mouse 
mammary tumor virus (MMTV) , Moloney murine leukemia 

20 virus, murine sarcoma virus, and Rous sarcoma virus, DNA 

delivery systems, i.e liposomes, and expression plasmid 
delivery systems. 



This invention also provides a vector system for the 
25 production of a polypeptide which comprises the vector in 

a suitable host. Suitable host includes a cell which 
includes, but is not limited, prokaryotic or eukaryotic 
cells, e.g. bacterial cells (including gram positive 
cells) , yeast cells, fungal cells, insect cells and 
30 animal cells. 



Suitable animal cells include, but are not limited to, 
HeLa cells, Cos cells, CV1 cells and various primary 
mammalian cells. Numerous mammalian cells may be used as 
35 hosts, including, but not limited to, the mouse 
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fibroblast cell NIH 3T3, CHO cells, Ltk' cells, etc. 
Expression plasmids such as that described supra may be 
used to transfect mammalian cells by methods well known 
in the art such as calcium phosphate precipitation, 
5 electroporation. 



This invention also provides a method for producing a 
polypeptide (e.g. acylcoenzyme A: cholesterol 
acyl transferase) which comprises growing a host vector 

10 system under suitable conditions permitting production of 
the polypeptide and recovering the polypeptide so 
produced. Methods of recovering polypeptides produced in 
such host vector systems are well-known in the art and 
typically include steps involving cell lysis, 

15 solubilization and chromatography. 

This invention also provides a method of obtaining a 
polypeptide in purified form which comprises: (a) 
introducing a vector, as described above, into a suitable 

20 host cell; (b) culturing the resulting cell so as to 
produce the polypeptide; (c) recovering the polypeptide 
produced in step (b) ; and (d) purifying the polypeptide 
so recovered. As discussed above the vector may include 
a plasmid, cosmid, yeast artificial chromosome, 

25 bacteriophage or eukaryotic viral DNA. Also, the host 
cell may be a bacterial cell (including gram positive 
cells), yeast cell, fungal cell, insect cell or animal 
cell. Suitable animals cells include, but are not 
limited to HeLa cells, Cos Cells, CV1 cells and various 

30 primary mammalian cells. Culturing methods useful for 
permitting transformed or transfected host cells to 
produce polypeptides are well known in the art as are the 
methods for recovering polypeptides from such cells and 
for purifying them. 
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Using the aforementioned method, this invention also 
provides a purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III and a purified mutant 
acylcoenzyme A: cholesterol acyltransf erase II or III. 

5 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 
nucleic acid which encodes a wildtype acylcoenzyme A: 

10 cholesterol acyltransf erase II or III without hybridizing 

to a nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or III. Further, this 
invention also provides an oligonucleotide of at least 15 
nucleotides capable of specifically hybridizing with a 

15 unique sequence of nucleotides present within the nucleic 

acid which encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase II or III without hybridizing to a 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III. These 

20 oligonucleotide DNA or RNA. Such oligonucleotides may be 

used in accordance with well known standard methods for 
known purposes, for example, to detect the presence in a 
sample of DNA which will hybridize thereto. 

25 The oligonucleotides include, but are not limited to, 

oligonucleotides that hybridize to mRNA encoding 
acylcoenzyme A: cholesterol acyltransferase II or III so 
as to prevent translation of the protein. 

30 This invention also provides a nucleic acid having a 

sequence complementary to the sequence of the isolated 
nucleic acid which encodes acylcoenzyme A: cholesterol 
acyltransferase II or III. 

35 This invention also provides a method for determining 
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whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterification of sterol which comprises (a) obtaining 
from the subject an appropriate sample containing a 
mixture of all of the subject's nucleic acids; and (b) 
determining whether any nucleic acid in the sample from 
step (a) is, or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase so as to thereby determine whether the 
subject's imbalance in sterol levels is due to a defect 
in esterification of sterol. The determination step (b) 
may comprises: (I) contacting the sample of step (a) with 
the isolated nucleic acid which encodes acylcoenzyme A: 
cholesterol acyltransferase II or III or the 
oligonucleotide of at least 15 nucleotides capable of 
specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyltransferase II 
or III without hybridizing to a nucleic acid which 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase II or III under conditions permitting 
binding of any nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase to the 
nucleic acid or oligonucleotide so as to form a complex; 
(ii) isolating the complex so formed; and (iii) 
identifying the nucleic acid in the isolated complex so 
as to thereby determine whether any nucleic acid in the 
sample contains a nucleic acid which is, or is derived 
from, a nucleic acid which encodes a mutant acylcoenzyme 
A: cholesterol acyltransferase II or III. in this 
method, both the isolation of any complex formed are 
effected using standard methods well known in the art. 



35 



In order to facilitate identification of the nucleic acid 
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from step (a) the isolated nucleic acid or the 
oligonucleotide is labeled with a detectable marker. The 
detectable marker may be a radioactive isotope, a 
fluorophore or an enzyme. In additions, the nucleic acid 
5 sample may be bound to a solid matrix before performing 

step (I) . 

This invention also provides a method for treating a 
subject who has an imbalance in sterol levels due to a 

10 defect in esterif ication of sterol which comprises 

introducing an isolated nucleic acid which encodes a 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 
or III into the subject under conditions such that the 
nucleic acid expresses a wildtype acylcoenzyme A: 

15 cholesterol acyltransf erase II or III, so as to thereby 

treat the subject. 

This invention also provides a method for inhibiting 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 

20 or III in a subject which comprises transforming 

appropriate cells from the subject with a vector which 
expresses the nucleic acid complementary to the isolated 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III, and introducing 

25 the transformed cells into the subject so as to thereby 

inhibit wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. Further, in a preferred 
embodiment, the nucleic acid is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme A: 

30 cholesterol acyltransferase II or III so as to prevent 

translation of the mRNA molecule. 

This invention also provides a method for inhibiting the 
wildtype acylcoenzyme A: cholesterol acyltransferase II 
35 or III in a subject which comprises introducing an 
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oligonucleotide of at least 15 nucleotides capable of 
specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyltransf erase II 
5 or III without hybridizing to a nucleic acid which 

encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase II or III into the subject so as to 
thereby inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. The oligonucleotide is 
10 capable of specifically hybridizing to a mRNA molecule 

encoding acylcoenzyme A: cholesterol acyltransferase II 
or III so as to prevent translation of the mRNA molecule. 

This invention also provides for a method for identifying 
15 a chemical compound which is capable of inhibiting 

acylcoenzyme A: cholesterol acyltransferase II or III in 
a subject which comprises (a) contacting a wildtype 
acylcoenzyme A: cholesterol acyltransferase II or III 
with the chemical compound under conditions permitting 
20 binding between the acylcoenzyme and the chemical 

compound (b) detecting specific binding of the chemical 
compound to the acylcoenzyme; and © determining whether 
the chemical compound inhibits the activity of the 
coenzyme so as to identify a chemical compound which is 
25 capable of inhibiting acylcoenzyme A: cholesterol 

acyltransferase II or III in a subject. 

This invention also provides method for differentially 
inhibiting one acylcoenzyme A: cholesterol 

30 acyltransferase but not others using the above methods. 

In an embodiment, only acylcoenzyme A: cholesterol 
acyltransferase I is inhibited. In another embodiment 
only acylcoenzyme A: cholesterol acyltransferase II 
(ARGP1) is inhibited. In an another embodiment only 

35 acylcoenzyme A: cholesterol acyltransferase III (ARGP2) 
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is inhibited. Alternatively, two of the acylcoenzyme A: 
cholesterol acyltransf erases may be inhibited. This 
invention further provides pharmaceutical compositions 
which will differentially inhibit one or more 
5 acylcoenzyme A: cholesterol acyltransf erases . 

This invention also provides for a pharmaceutical 
composition comprising the chemical compound identified 
by the above-described method in an amount effective to 
10 inhibit acylcoenzyme A: cholesterol acyltransf erase II or 

III in a subject and a pharmaceutical^ effective 
carrier. 

This invention also provides a method of treating a 
15 subject who has atherosclerosis comprising the above- 

described pharmaceutical composition. A method of 
treating a subject who has hyperlipidemia comprising the 
above-described pharmaceutical composition. 

20 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransf erase II or III. 
The mammal includes, but is not limited to, a mouse, 
bovine, cat or dog. 

25 

This invention is illustrated in the Experimental Details 
section which follows. These sections are set forth to 
aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
30 way the invention as set forth in the claims which follow 

thereafter . 
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Experimental Details 

First Series of Experiments 

Example 1 : 

5 Materials and Methods: 

Transformation of yeast was performed with lithium 
acetate (15) by amino-acid prototrophy selection. A 
diploid strain (5051) was constructed between two 
isogenic derivatives of W303 (16); W1346-3C (MATa, 
ade2-l, canl~100, his3-ll, 15, leu2-3, 112, trpl-1, 
ura3-l) and W1134-2C (MATa, canl-100, his3-ll, 15, 
leu2~3, 112, trpl-1, ura3-l, metl4DHpaI-SalI) . Growth on 
complete (YEPD) or synthetic medium, sporulation and 
dissection was performed as described (17) . 

Competent cells of Escherichia coli strain DH5a 
(Gibco-BRL) and DNA modifying enzymes (Promega) were used 
according to the manufacturers instructions. pH3{34), 
from L.A. Grivell, was digested with Nhe I, blunt-ended 
with Klenow sequences, and digested with Avr II to 
liberate a 1614-bp fragment. An Xba I, Sma I fragment of 
pJH-Hl encoding the HIS3 gene was then inserted at these 
sites in the vector backbone to produce the arelANA 
allele. This construct was digested with Bsa I to 
liberate a 3821-bp fragment which was then transformed 
into strain 5051. Disruption of ARE1 was confirmed by 
Southern blot analysis. 

Radioactive probes of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I were prepared by random 
priming (Pharmacia) with "P-dCTP. Genomic DNA (18) was 
transferred to Hybond membranes (Amersham) and hybridized 
in the absence of formamide at 65° or 60°C (19) . 
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A shotgun library of cosmid 14-21 from chromosome XIV 
(Peter Philippsen, Biozentrum Basel) was constructed 
using the nebulizing technique (20) . The DNA was 
nebulized (90 seconds, 2 bars), size fractionated, 
5 treated with DNA polymerase I (Klenow fragment) and T4 

DNA polymerase and blunt-end ligated into pTZ18R 
(Pharmazia, Germany) . Nucleotide sequencing was 

performed by dideoxy-chain-termination with 
digoxigenin-labeled reverse primer and Sequenase (United 

10 States Biochemical) . The reactions were analyzed on the 

GATC 1500 direct blotting electrophoresis system (GATC 
GmbH, Germany) using the Boehringer-Mannheim 
Dig-development protocol. Sequences were aligned by 
SeqMan (DNA Star Inc.). Database searching was performed 

15 with BLAST (21) and GCG Inc. software (22) . The DNA 

sequence of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme 1 and acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II genes are 
deposited at GenBank (P25628 and U51790, respectively) . 

20 

KO-5 and KO-3' primers 

(GAGGGGACGAAAATTAGCCGCTATTAATTCTGGTATTGCCACCTAGACAAGAAG 
TAAACAGACACAGATGcaagagttcgaatctcttagc (Sequence ID No.: 
15) and CTATAAAGATTTAATAGCTCCACAGAACAGTTGCAGGATGCCTTAGGGT 

25 CGActacgtcgtaaggccgtttctgac (Sequence ID No.: 16), 

respectively; lower case; corresponds to the LEU2 gene) 
were used in a PCR with the LEU 2 gene as a template to 
produce the selectable yeast gene flanked by acylcoenzyme 
A: cholesterol acyltransferase-related enzyme II gene 

30 sequences (23). This was used to transform, a derivative 

of yeast strain 5051, heterozygous for the arelANA 
allele. To identify integrants at the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II locus, a 
PCR was performed on genomic DNA from these strains using 
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are2-5' (CATTGCAGTTACACGTGAATGC) (Sequence ID No.: 17), 
are2-3' : ( TAGCTCCACAGAACAGTTGCAGG ) (Sequence ID No.: 18) 
and a 3' amplimer corresponding to the LEU2 gene (L2-3' 
CTCTGACAACAACGAAGTCAG) (Sequence ID No. : 19) . 

1-2 units at an absorbance of 6000nm of cells were 
incubated in YEPD or defined media containing 1 ^Ci/ml 
^H-oleate in tyloxapol/ethanol (1:1) for 16 hours. Total 
lipids were prepared by hexane extraction (25) and 
analysed by thin layer chromatography on DOplastikf olien 
kieselgel 60 plates (E-Merck, Germany) . The plate was 
developed in hexane, diethyl ether and acetic acid 
(70:30:1) and stained with iodine vapor. Incorporation 
of label into triglyceride and ergosterol ester was 
ascertained following scintillation counting and 
normalization to a "Ocholesterol internal standard and 
the dry weight of the cells. 

To overexpress the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I gene by copy number 
under the control of its own promoter in YEp3-16, a 2354 
bp Cla I fragment from pH3(34), encompassing the entire 
acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I gene, was made blunt-ended with Klenow DNA 
polymerase I and introduced into the Sma I site of 
YEp352. To constitutively overexpress acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I from the ADH 
promoter in pADH5-36, a 2290 bp Nar I fragment of 
pH3(34), starting 70 bp 5' to the ORF was blunt-ended 
with Klenow and ligated to Klenow-treated, Eco RI 
digested, pDC-ADH (a derivative of pS5) (26). Increased 
expression of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I transcripts, relative to 
a wild-type cell, was confirmed by northern blot 
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analysis . 

The incorporation of [1- M C] acetate into saponified 
lipids was assessed as a measurement of sterol synthesis. 
5 Approximately 2 OD 000 units of cells were incubated with 

20 //Ci [1- H C] acetate in 2 ml defined media at 30°C for 
3 hours and subjected to lipid saponification, hexane 
extraction and TLC chromatography (29) . The 
incorporation of counts into total sterols were assessed 
10 following scintillation counting. To normalize the 

estimate of sterol biosynthesis to incorporation of 
acetate into the fatty acid pool, the aqueous lysate 
remaining after hexane extraction was acidified with 
concentrated HC1 and re-extracted with hexane (30) . 

15 

Experimental Discussion 

To use yeast genetics to study sterol ester if ication, the 
human acylcoenzyme A: cholesterol acyltransf erase 
sequence was used to search for homologous yeast genes 
and subsequently to identify an additional human isoform 
(Figures 1A and IB) . Acylcoenzyme A: cholesterol 
acyltransf erase related enzyme I, an 1830-bp open reading 
frame (ORF) on yeast chromosome III, encodes a 610- 
residue protein with 23% identity and 49% similarity to 
human acylcoenzyme A: cholesterol acyltransf erasel 
(Figures 1A and IB) . The yeast and human proteins 
possess leucine zipper motifs that could mediate 
protein-protein interactions (esterif ication is probably 
performed by a multimeric complex) (12), and possess at 
least two predicted transmembrane domains that may 
mediate the membrane association of the acylcoenzyme A: 
cholesterol acyltransf erase reaction (13, 14) . 

35 To define the role of acylcoenzyme A: cholesterol 
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acyltransf erase-related enzyme I in sterol 
esterification, the deletion mutant, arelANA, was 
generated by homologous recombination (15, 16, 17) (Fig. 
2A) . In a diploid strain, a 1614-bp segment of one 
5 acylcoenzyme A: cholesterol acyltransferase-related 

enzyme I allele was replaced with the HIS3 gene and 
confirmed by Southern hybridization (Fig. 2B) . Analysis 
of mutant and wild-type haploid progeny from this diploid 
indicated no differences in growth rates or incorporation 
10 of 3 H-oleate into ergosterol ester. 



The lack of a defect in sterol esterification in arelANA 
strains could result from alternate esterification 
activities. Reduced stringency hybridization of yeast 
genomic DNA with the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I coding sequence as a 
probe indicated that additional homologous sequences were 
present (18, 19). A Bam HI digestion of genomic DNA 
produced the predicted 2.9-kb acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I fragment and a -6.0-kb 
hybridizing fragment (Fig. 2C) . Contour clamped 
homogeneous electric field electrophoretic analysis of 
yeast chromosomes suggested the latter sequence was 
localized to chomosome X or XIV. On the basis of 
homology to acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I, this gene, designated acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II, encodes a 
second yeast homolog to human acylcoenzyme A: cholesterol 
acyltransferasel (Figures 1A and IB). The genomic 
sequence (20, 21, 22) encompassing acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II on 
chromosome XIV predicts a 5997-bp Bam HI fragment and a 
1929-bp ORF, which translates into a 643-residue 
polypeptide. The yeast acylcoenzyme A: cholesterol 
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acyltransferase related enzymes genes are 61% and 49?, 
identical at the DNA and predicted protein levels, 
respectively. Arelp, Are2p and the human acylcoenzyme A: 
cholesterol acyltransferasel protein are most related at 
5 the COOH-terminal region (42% identity over a 90-residue 

sequence) (Figures 1A and IB) . 

To . assess the contribution of Are2p to sterol 
esterif ication, one copy of the acylcoenzyme A: 

10 cholesterol acyltransf erase-related enzyme II coding 

sequence was deleted from the genome of an AREl/arelANA 
heterozygous diploid by a polymerase chain reaction 
approach (23) (Fig. 2D) . Haploid progeny representing 
the single arelANA and are2A deletions and the arelA 

15 NAare2A double mutant were obtained. To ascertain the 

effect of deletion of acylcoenzyme A: cholesterol 
acytransf erase-related enzymes genes upon cytoplasmic 
lipid storage, the neutral lipid components (triglyceride 
and sterol ester) of the yeast cells were detected by 

20 fluorescence microscopy after staining with Nile Red 

(24). In wild-type cells, cytoplasmic fluorescent 
droplets accumulated in stationary phase cultures (Fig. 
3A) . No differences in are single mutants were detected. 
However, the number of "droplets observed in arelANAare2A 

25 double mutants, was one-third to that in wild-type 

strains (Fig. 3B; over multiple fields, 5.57 ± 2.73 vs. 
16.73 ± 4.6 droplets/cell, P<0.05). 

The wild-type and are mutant cells were analyzed for the 
30 incorporation of *H-oleate into sterol ester (25) (Fig. 

4B) . No significant differences in triglyceride 
biosynthesis were detected. In contrast to normal sterol 
ester biosynthesis observed in arelANA mutants, 
deficiencies in sterol esterif ication were apparent in 
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both are2A and arelANAare2A mutants. These were detected 
by iodine vapor staining of thin layer chromatographs of 
total yeast lipids in addition to the oleate 
incorporation assays. Sterol ester levels of are2A 
single mutants were reduced to less than 26% of wild- type 
suggesting the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II isoform to confer the 
majority of acyltransf erase activity. The arelANAare2A 
double mutant was almost totally deficient in sterol 
esterification (less than 1% of wild-type levels) . In 
confirmation of the critical role of Are proteins in 
sterol esterification, microsomes from double mutant 
yeast cells lacked acylcoenzyme A: cholesterol 
acyltransf erase activity when assayed in vitro. 



To confirm that the protein encoded by an acylcoenzyme A: 
cholesterol acyltransferase-related enzymes ORF was 
sufficient for sterol esterification, the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I coding 
sequence was over-expressed in vectors with increased 
copy number (YEp3-16) or elevated transcription (the 
v alcohol dehydrogenase promoter in pADH5-36) (26) . There 

were no detectable changes in triglyceride or 
phospholipid biosynthesis resulting 'from acylcoenzyme A: 
25 cholesterol acyltransferase-related enzyme I 

over-expression. in are2A or arelANAare2A double 
mutants, acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I over-expression complemented the sterol 
esterification defect (Fig. 4C) . In wild-type and 
arelDNA single mutants, the high level expression of 
acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I did not elevate sterol ester synthesis above 
untransformed controls. This suggests that either 
substrates are limiting in acylcoenzyme A: cholesterol 
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acyitransferase-related enzymes strains or that the 
enzyme is post-translationally regulated as in mammalian 
cells (27) . 

5 An accumulation of unesterified sterol in cell membranes 
would likely be deleterious (28) . However, despite the 
major changes in sterol esterif ication conferred by the 
are mutants, we did not detect any reduction in growth 
rates. The established role of sterol esterif ication in 

10 the storage of sterol suggests that an inability to 

esterify sterol could lead to homeostatic changes in 
sterol biosynthesis. This relationship might account for 
the viability of the mutants. Total lipids, labelled by 
the incorporation of [1- M C] acetate into exponentially 

15 growing cells (29, 30), were saponified and extracted. 

The arel ANaare2A double mutants had a two to three- fold 
lower level of sterol biosynthesis than wild-type cells, 
although no changes were observed in the single mutants 
(Fig. 4D) . In fact, free sterol concentrations were 

20 roughly equivalent in all cells. Feedback regulation of 

sterol biosynthesis by acylcoenzyme A: cholesterol 
acyltransf erase activity has been observed in mammalian 
cells (31) and may be a common mechanism that maintains 
intracellular sterol at non-toxic concentrations. 

25 

The involvement of multiple gene families in sterol 
homeostasis is common in mammalian and yeast cells, for 
example, the LDL receptor related protein and scavenger 
receptor gene families, the SREBP family, and 3-hydroxy- 
30 3-methyl-glutanyl-CoA reductase) (4, 32, 33, 34). This 

apparent redundancy of function has clear physiological 
consequences as evidenced by deletion of any one of the 
family members. The observation here of two yeast genes 
for sterol esterif ication provoked the hypothesis of 
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similar redundancy for this reaction in humans. To this 
end, a consensus of the yeast acylcoenzyme A: cholesterol 
acyltransf erase-related enzymes and human acylcoenzyme A: 
cholesterol acyltransf erasel sequences was used to 
5 identify an additional cDNA with significant identity 

(47%) to human acylcoenzyme A: cholesterol 
acyltransferasel and the yeast proteins (Figure IB, 
Genbank accession # R07932) . 

Sterol homeostasis is a complex event under subtle 
regulatory controls, one component of which is sterol 
esterification. The demonstration here of multiple yeast 
and human acylcoenzyme A: cholesterol acyltransf erase 
isoforms raises the possibility that in vivo, the enzymes 
exhibit alternate substrate preferences. The analysis of 
esterification reactions in yeast is likely to impact the 
understanding of sterol homeostasis and atherosclerosis 
in humans. 

20 Example 2: 

Tissue specific expression of acylcoenyme A: cholesterol 
acyltransf erase II was analyzed by Northern blot RNA 
hybridization of RNA obtained from the described tissues. 

25 Using the same materials and procedures of Chang, et al . 

(10), the specific expression of acylcoenyme A: 
cholesterol acyltransf erase II in liver and muscle is 
documents, in contrast to similar experiments using the 
previously known acylcoenyme A: cholesterol 

30 acyltransf erase I (10) (Figures 7A and 7B) . Acylcoenyme 

A: cholesterol acyltransf erase II was also detected and 
specifically expressed in adrenal, thyroid and testicular 
tissues . 
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Example 3: 

After determining the consensus sequence between the two 
yeast gene and the previously known human acylcoenzyme A: 
5 cholesterol acyltransf erase, the consensus sequence was 

compared to sequences deposited in Genbank. The clones 
containing the sequences that showed similarity to the 
consensus sequence were ordered from the I.M.A.G.E. 
Consortium, affiliated with Research Genetics, Inc., 2130 

10 Memorial Parkway S.W. Huntsville, Alabama 35801. Clones 
deposited with the I.M.A.G.E. consortium are publicly 
available upon request. A particular clone, Genbank ID 
clone No. Z39933 was chosen. This clone contains a cDNA 
fragment whose sequence encodes human acylcoenyme A: 

15 cholesterol acyl transferase II. The fragment was cut out 

with restrictions enzymes Bgl II and Not I. The 
resulting fragment was introduced into the yeast 
expression vector pRS426 at Bgl II and Not I sites 
downstream of the yeast promoter (GAL1/GA110) which is 

20 regulated by carbon sources. The resultant vector was 

designated YepAB-ACAT2 (Figure 6) . 

Example 4 : 

25 Antisense RNA technology can be used to create mice, or 

mouse or human cell lines incapable of translating 
acylcoenzyme A: cholesterol acyltransf erase II RNA into 
protein. Standard methods may be used to create an 
antisense oligonucleotide to the human homolog of 

30 acylcoenzyme A: cholesterol acyltransferase II These 

methods are well known in the art (36) . 

Specifically, part or all of a wildtype acylcoenzyme A: 
cholesterol acyltransferase II is ligated adjacent to a 
35 mammalian promoter in the opposite orientation. The 



WO 97/45439 



PCT/US97/09460 



-37- 

promoter and other replicatory mechanisms inside the cell 
will transcribe a human homolog of acylcoenzyme A: 
cholesterol acyl transferase II encoding, nonsense strand. 
This strand will bind with the coding mRNA which is 
5 normally synthesized to form a complex. Due to the 

formation of this complex, the antisense strand prevents 
the translation of the coding mRNA into protein. 

Further, one skilled in the art can synthesize an 
10 oligonucleotide in vitro which is capable of binding the 

mRNA that encodes a human homolog of acylcoenzyme A: 
cholesterol acyltransf erase II so as to inhibit the 
translation of the mRNA into protein. The 
oligonucleotides can then be introduced into the subject 
15 using a pharmaceutical^ acceptable carrier. Methods of 

synthesizing naturally and non-naturally occurring 
oligonucleotides which are capable of inhibiting the 
translation of the mRNA into protein are well known in 
the art. Also, means of transfecting an organism with 
.20 such oligonucleotides are well known in the field. 

Example 5: 

Mice can be made with an alteration in their genome, 
25 specifically at the acylcoenyme A: cholesterol 

acyltransf erase II gene site. Standard methods may be 
used to alter the genome. These methods are well known 
in the art (37, 38) . 

30 One such process to achieve this goal involves disrupting 
the wildtype mouse homolog of acylcoenyme A: cholesterol 
acyltransferase II in vitro, then introducing the altered 
gene into mouse embryonal stem cells in such a way as to 
taret integration into the corresponding genomic region. 

35 This process can be performed such that both copies of 
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the wildtype acylcoenyme A: cholesterol acyltransf erase 
II are replaced by the altered, knock-out version. These 
modified cells can be introduced into blastocysts which 
will be allowed to develop into chimeric adults. Mice 
5 bearing the altered acylcoenyme A: cholesterol 
acyltransferase II gene will be mated to each other to 
generate homozygous mutant acylcoenyme A: cholesterol 
acyltransferase II animals. 

10 Further, one can breed two mice who are heterozygous for 

mutant acylcoenzyme A: cholesterol acyltransferase II. 
From their progeny, one skilled in the art could select 
the progeny who are homozygous for mutant acylcoenzyme A: 
cholesterol acyltransferase II. Breeding and selecting 

15 such progeny are well known in the art. 
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Second Series of Experiments 

The efficient regulation of intracellular sterol levels 
is required for cell viability by all eukaryotic 
5 organisms. When this regulation is aberrant in cells of 

the arterial wall, disease states such as atherosclerosis 
ensue. A critical component of this homeostasis is 
intracellular sterol esterif ication reaction, mediated by 
the enzyme, acyl coenzyme A-cholesterol acyltransf erase 
10 (ACAT) . In the model eukaryote, yeast, this laboratory 

has demonstrated that sterol esterif ication is mediated 
by a two gene family (Yang et al . , Science . 1996, 
272:1353). The existence in human cells of two 
additional genes encoding ACAT related enzymes are 
15 demonstrated. These protein are termed ACAT related 

gene products (ARGP) l and 2, also known as acylcoenzyme 
A: cholesterol acyltransf erase II and acylcoenzyme A: 
cholesterol acyltransf erase III respectively. The ARGPs 
exhibit marked sequence conservation to the human ACAT 
20 sequence (hACAT) originally identified by Chang and 

colleagues. ARGP1 is expressed at high levels in 
intestine and liver in contrast to the expression of 
hACAT which is of low abundance in these tissues. The 
observation that knock-out mutant mice deficient in the 
25 murine homolog if hACAT retain sterol esterif ication 

activity in liver and intestine (Meiner et al. f PNAS, 
1996, 93:14041), suggests that ARGP1 is a candidate for 
sterol esterif ication in these tissues. The expression 
of ARGP2, by contrast, seems to be restricted to the 
fetal liver, suggesting it to have a role in lipid 
metabolism during development. Analysis of genome 
databases indicates that ACAT-like gene families are a 
common occurrence in multiple organisms. It is 

hypothesize that multiple enzymes for sterol 
35 esterif ication will provide flexibility in response to 

differing sterol and fatty acid substrates encountered by 
different tissues. This further suggests specific roles 
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for these enzymes in lipoprotein production, lipid 
homeostasis, and disease progression. 

The regulation of membrane sterol levels is required for 
5 cell viability by all eukaryotic organisms. When this 

regulation is aberrant in human cells, disease states 
such as atherosclerosis {excessive accumulation of 
cellular esterified cholesterol in cells of the arterial 
wall, reviewed in (1-4)), Niemann Pick C (inability to 

10 store sterol correctly, resulting in lysosomal lipidosis, 

(5)) or Wollmann's disease (a defect in sterol ester 
hydrolysis, (6)) ensue. A critical component of this 
homeostasis is the intracellular neutralization of sterol 
by an esterif ication reaction between the C 3 -OH group of 

15 cholesterol and fatty acyl -coenzyme A. This reaction is 

performed in mammalian cells by the enzyme acyl coenzyme 
A-cholesterol acyltransf erase (ACAT) . Since the process 
of sterol esterif ication converts sterol into a 
cytoplasmic storage form, it is critical to all 

20 eukaryote, including the microorganism Saccharomyces 

cerevisiae (budding yeast) . Analysis if sterol 

homeostasis in this model organism has the advantage that 
molecular genetics, particularly since the completion of 
the yeast genome sequencing project, is powerful and 

25 relatively straightforward. Taking advantage of this, it 

is demonstrated that sterol esterif ication in yeast is 
mediated by a two gene family (7) , neither of which is 
essential for life. These genes ( ARE1 and ARE 2 ; 
encoding ACAT R elated Enzymes 1 and 2, respectively) are 

30 both capable of independently esterif ying sterol, 

although in terms of contribution to the sterol ester 
mass of the cell, Arel is a minor isoform relative to 
Are2 . The genes are structurally and functionally 
analogous to the ACAT sequence isolated originally from 

35 macrophages by Chang and colleagues (8) . They share 

approximately 23% identity at the protein level and 
expression of the human macrophage ACAT cDNA in yeast are 
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double deletion mutants results in esterif ication of 
sterol (9) . 



A critical test of the role of the ACAT gene product in 
5 cholesterol homeostasis and atherosclerosis was initiated 

by Farese and colleagues, by the production of "knock- 
outs" at the Acact locus corresponding to the mouse 
homolog of hACAT (10) . The fidelity of the mutation was 
confirmed by sequencing of cDNA from the disrupted allele 

10 and by the failure to detect immunoreactive protein in 

Acactf^cell extracts. The animals were healthy and 
fertile and had residual, but significant, sterol 
esterif ication activity in fibroblasts and macrophages. 
Cholesterol ester levels and ACAT activity in the 

15 adrenals were also severely reduced. Conversely, Acact^ 

livers contained significant levels of cholesterol ester, 
and esterif ication activity was not altered. 
Furthermore, sterol absorption in the intestine, a 
process that probably requires esterif ication, was 

20 unaffected by the gene disruption. These observations 

strongly suggest that as in yeast, there are multiple 
genes for the ACAT reaction in mammalian cells, probably 
with tissue specific expression patterns. 

25 Interestingly, despite the clear origin of the yeast gene 

family by gene duplication, the ARE proteins have 
diverged such that the majority of sequence conservation 
is in the COOH- terminal domain of the protein. This is 
presumably the critical region of the molecule, since it 

30 is also conserved in the human protein. Using this 

region as a database probe, R07932 (11) was identified, 
a partially sequenced cDNA entry in the database of 
expressed sequence tags (best) ; R07932 exhibits 
significant similarity to the ACATs particularly over the 

35 COOH- terminal region. Taken together; the "founder" 

sequence, the observations in yeast of a two gene family 
for sterol esterif ication, and the tissue-specific 
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expression patterns of enzyme activity in Acac t^ knock - out 
mice, suggest that there are multiple genes for this 
reaction in all eukaryote. It is reported here the 
isolation and characterization of cDNAs from two human 
5 loci that encode ACAT Related Gene Products (ARGP) . 

ARGP1 is represented multiple times in the best, 
including R07932, and is expressed ubiquitously with the 
highest levels occurring in the liver, intestine and 
adrenal gland. By contrast, sequences identical to ARGP 2 
10 in the databases are infrequent, consistent with the 

observation of an essentially embryonic pattern of 
expression. Analysis of genome databases indicates that 
gene families that conserve these motifs are a common 
occurrence in multiple organism. 

15 

Materials and Methods . 

Database searching for ACAT related sequences. A 
sequence corresponding to the strongest region of protein 
conservation between the human macrophage ACAT and yeast 

20 ARE sequences was used to identify protein sequences 

predicted to be encoded by entries in the best using the 
tblastn software (NCBLI) . The DNA sequences thus arising 
were used to detect additional clones in any available 
database, that demonstrated overlaps of nucleotide 

25 sequence identity. Databases searched included; best, 

the non- redundant GENBANK, and the confidential database 
held at The Institute of Genome Research (TIGR) . 
Overlaps between these sequences were detected using the 
sequence alignment programs, "lineup" and "pileup" from GCG 

30 Inc (Madison, WI) . A consensus sequence was then 

generated. Escherichia coli clones with the largest 
inserts corresponding to these sequences (see table 1) 
were obtained from the I.M.A.G.E. consortium and 
resequenced from both ends using commercial primers, T3 

35 and T7, or internal primers derived from a consensus. 

Nucleotide sequencing was performed at the Columbia 
University Combined Center core facility using an Applied 
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Biosystems fluorescent sequencing machine, 



Table 1 



Entries of human ACAT related gene products in 
the products in the data base of expressed 
sequence tags . 



10 



15 



20 



25 



30 



35 



Gene 



ARGP1 



ARGP2 



Clone ID 
(IMAGE) 

200587 

55218 



1881180 

78614 
153836 

106260 

128921 

213176 
245265 



GENEBANK ID 

R99213 

R99214 

c-IMFll 

Z43867 

Z33993 

H45923 

H45924 

M79086 

R484 74 

R48475 

T35085 

R10272 
R10273 
N75438 
H76642 



Insert size 
(bp) 

620 

1800 

1000 

300 
800 

800 

680 

540 
300 



Comments 



chimera 



Isolation an d sequencing analysis of full length cDNA 
clones of ARGP1 and ARGP2. Since in no instance were any 
of the database clones full length for either ARGP1 or 
ARQP2, additional clones with intact 5' -ends are 
described. Several strategies were chosen using a 
consensus nucleotide sequence derived from the sequencing 
of the best clones designed and synthesized 3 -end, gene 
specific primers and used a PCR based, rapid 
amplification of cDNA ends (RACE) to derive 5' -RACE 
reaction products from a human liver/spleen Marathon 
library (Clontech*) . Similar strategy was used to derive 
PCR products from a human fetal brain library generously 
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provided by Bento Soares (Columbia University) . In some 
instances, a nested PCT reaction was performed using 
internal gene specific primers and library adaptors. 
Finally primer extension cDNA products were identified 
5 from mRNA extracted from human intestine (a kind gift of 

P. Dawson) . Amplification products of the predicted size 
were confirmed as gene specific, using southern 
hybridization to sequences predicted to be at the 3' -end 
of these products. The products were isolated from 

10 agarose gels using Geneclean and subcloned into TA 

variants of pBluescript (Stratagene*) vectors of 
klenow/kinase treated and blunt end ligated to pGEM2 
(Promega*) . Positive clones were identified by colony 
hybridization or by PGR amplifications using an internal 

15 ARGP specific primer. Clones with the largest inserts 

were sequenced to obtain novel sequence and where 
necessary, this process was reiterated with ARGP 5' 
specific primers derived f rom the new sequence . 

20 Tissue specific expression of hACAT and ARGps . Fragments 

of the best clones R99213 and R10273 corresponding to 
ARGP1 and ARGP2 , respectively were derived by digestion 
with EcoRI and Not I. and purified from agarose gels with 
Geneclean. A 1.6 kbp fragment corresponding to the human 

25 ACAT cDNA identified by Chang et al was used as a probe 

for the expression of this gene. Radiolabelled probes 
were generated by random priming (Pharmacia*) in the 
presence of 32 -P dCTP and used to probe Multiple Tissue 
Northerns (MTN, Clontech*) of human samples. 

30 Hybridizations were performed, according to the 

manufacturers instructions, using ExpressHyb rapid 
hybridization solution for 1 hour at 78°C, followed by 
washed in 2xSSC at 55°C and O.lxSSC, 0.5%SDS at 50°C. 

35 Cell culture expression of ARGPs . To facilitate 

quantitation of mRNA from the ARGP genes, a reverse - 
transcriptase PCR (RT-PCR) approach was devised to 
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analyze expression in a variety of human (HeoG2 , THP-1 
macrophages) and rodent (J774 macrophages) and simian 

(CVl kidney cells) . Where possible, primers were 
designed to be conserved between rodents and humans (as 
5 described below, the mouse sequence homolog to ARGP1 ) has 

been identified. Alternatively, PCR conditions were 
optimized to permit moderate mismatches. The ARGP 
amplification primers were designed to be gene specific 

(i.e. to regions not conserve within the family) and to 
10 produce distinct size products. 

Experimental Results and Discussion 

The approach that the region of strongest conservation 
between the yeast ARE proteins and hACAT would be 

15 critical to the function of any sterol esterif ication 

enzymes was taken. A region of conservation ( consensus ; 

LN E FGDR - FY GDWWN, single letter amino-acid code) 

that is invariant over the three proteins was chosen and 
a series of entries derived from gene sequencing projects 

20 identified. In addition to sequences from 

Caenorhabditis. eleqans. Schizosacharomuces pombe. 
Drosophila melanoaater and Arabidopis thaliens . several 
entries in the best of human cDNAs that suggested an 
independent gene encoding an ACAT like protein were 

25 observed. Using the nucleotide sequence to this clone, 

a second homologous but distinct entry was identified. 
These proteins are termed, ACAT Related Gene Products 
( ARGP) 1 (acylcoenzyme A: cholesterol acyl trans f erase II) 
and 2 (acylcoenzyme A: cholesterol acyl transferase III) . 

30 The sequence identified by Chang et al (8) will be 

referred to as hACAT, hereon. A limited protein sequence 
to a founder clone (R07932) to ARGP1 has been presented 
previously (11) . The entries in the best that define 
these two genes, including their insert sizes are 

35 described in table 1. As is evident, the majority of 

inserts (with the exception of a chimeric clone ZA3867) 
are less than lKbp. The northern and sequence analysis 
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presented indicated them to be incomplete clones . 
However, they clearly define two distinct genes of strong 
similarity to the ACAT sequence, with the majority of 
predicted protein conservation at the COOH-terminal 
5 region. As described below certain motifs considered 

critical to sterol esterif ication are conserved. To 
identify the role of these genes in the reaction, full 
length ARGP clones were sought and their patterns 
identified. 

10 

ARGP1, a ubiquitously expressed member of the ACAT gene 
family. To establish the profile of expression of ARGP1 . 
probed multiple tissue northerns of human mRNA was 
probed, using a fragment close to the 3' end of the gene. 

15 Although this region displays the maximum conservation at 

the protein level in this gene family, the genes are 
sufficiently divergent at the DNA level to be able to 
design gene specific hybridization probes. The ARGP1 
sequence is expressed at abundant levels in may tissues 

20 with the exception of lung and kidney. The majority of 

tissues express a 2.0kb message but, some tissues {e.g. 
adrenal, small intestine, thymus) also express a 2.4kb 
mRNA at varying levels. The same northerns were 
hybridized with a probe to the human macrophage ACAT 

25 sequence. As described by others (8, 12, 13) , the hACAT 

sequence detects 4 messages of approximately 3.0, 4.0, 
4.7 and 7.4Kb. Upon comparison of the two hybridization 
results, an overlapping but occasionally differential 
expression pattern was observed. Adrenal tissues express 

30 the highest levels of both hACAT and ARGP1 message. By 

this analysis, hACAT messages are rare in liver and 
intestine in contrast to ARGP1 which is highly expressed 
in these tissues. Conversely, ARGP1 was poorly expressed 
in kidney, lung and placenta although hACAT mRNA was 

35 easily detected. This tissue specific expression 

suggests that ARGP1 is an ideal candidate for sterol 
esterif ication in tissues such as liver and intestine, 
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which retain sterol esterif ication activity in ACAT k/o 
mice (10) . 



ARGP2 , an embr yonic isoform of the ACAT gene family. 

5 Efforts to identify a transcript from ARGP2 in adult 

tissues were unsuccessful. Therefore embryonic tissue 
samples were chosen to investigate since the original 
founder clone was derived from a fetal liver library. A 
multiple tissue northern of mRNA from human embryonic 

10 brain, liver, kidney, and lung, were probed with and 

ARGP2 specific, C00H- terminal probe. As shown in figure 
9, a single message of ~2.2kb was identified only in 
embryonic liver tissues, suggesting a high degree of 
tissue and developmental specificity to the expression of 

15 this gene product. 

Expression of ARGP1 in cell culture models. To develop 
a system in which to test the effect of reaction 
substrates on the esterif ication reaction performed by 

20 the ARGP enzymes. The expression of these genes in 

several tissue specific were examined, cell culture 
models. As shown in figure 10, ARGP1 is clearly 
expressed in liver (HepG2) and Kidney (CV-1) cell lines. 
The latter result is somewhat in contrast to the northern 

25 blot on human tissue samples. This most likely reflects 

the sensitivity of the RT-PCR approach compared to filter 
hybridization and suggests that ARGP1 is probably 
expressed in most tissues. Alternatively it may 
represent species difference (simian vs. human) or more 

30 interestingly the differentiation status of the cells 

under study. In data not shown here, ARGP1 was also 
clearly expressed in human and mouse macrophage models 
{THP-1 and J774 cells) . 



35 



Sequence characteristics of ARGP1 and ARGP2 . By a 
combination of 5' -RACE and primer extension additional 
sequence to cDNA s for ARGP1 and ARGP2 (Figs. 11 and 12) 
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have been identified. The ARGP1 sequence predicts a 4 07 
amino-acid protein with approximately 27% identity and 
52% similarity to the hACAT protein (Fig. 4) . 
Interestingly, as it was observed for the yeast ARE 
5 proteins, the strongest conservation exists at the COOH- 
terminus of the molecules, to the extent that the NH-2- 
terminal 50% of all these proteins is essentially 
unrelated sequence. This pattern also persists at the 
DNA level (not shown) . Identification of the genomic 

10 sequence to these cDNAs will establish whether this 

remarkable divergence arises by exon shuffling of common 
sequences. Alternatively, convergent evolution of 
domains with conserved functions in sterol ester if ication 
or related processes, may have resulted in the generation 

15 of these families. Since the level of DNA conservation 

between ARGP1 and hACAT is quite low (37% identity) , the 
latter possibility seems likely. The conserved regions 
are discussed in the context of multiple ACAT like 
sequences below. The ARGP1 sequence predicts a protein 

20 of approximately 47kDa with multiple transmembrane 

domains in similar positions to those predicted in hACAT. 
This strongly suggests a membrane location for ARGP1 as 
would be predicted for a sterol esterif ication enzyme. 

25 ARGP2 displays a significantly higher level of amino acid 

conservation with hACAT than does ARGP1 . Over the 
sequence shown (Fig. 12), the protein is 59% identical 
and 79% similar to human ACAT. Over the same region 
ARGP1 is only conserved at the level of 32% identity. 

30 This striking identity is maintained at the DNA level 

(62% identity) and may suggest that ARGP2 is more closely 
analogous to hACAT in both its mechanism of action and 
its origin, than is ARGP1 . As for ARGP1 , certain 
hallmark sequences are retained in ARGP2 (see below) . 

35 The ARGP2 predicted protein also possesses several 

predicted transmembrane domains. One entry to the best 
for ARGP2 has also been allocated an STS (sequence tagged 
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site) at the Whitehead Institute, (entry 8 WI-11660) and 
has thus been mapped to human chromosome 12 . 

Sterol esterif ication enzymes evolve as gene families in 
5 multiple organisms. Using the hACAT and AGRP nucleotide 

sequences as probes of multiple databases, we sought to 
establish whether the observation of gene families of 
ACAT related enzymes in yeast and humans was a common 
occurrence in other organisms. In general this is the 

10 case (Fig. 13). Sequences from the genome of C. eleaans. 

EL — melanogastor and S . pombe . have been identified that 
are distinct from each other, within an organism, and 
exhibit approximately 25% identity at the predicted 
protein level. As for all the ACAT-like proteins, the 

15 maximum conservation is observed at the COOH- terminal 

region, with many of the apparently critical motifs 
described below, being maintained. As would be 
anticipated the mouse cDNA for ARGP1 exhibits 
approximately 85% identity with its human homolog. 

20 

Seguence conserv ation between ARGPs and ACAT in multiple 
organisms . As described above, these sequences are 
ubiquitous. This conservation, across and within 
organisms, facilitates the identification of critical 

25 domains of esterif ication enzymes (Fig. 14) . 

Interestingly, there is no sequence similarity between 
any ACAT-like molecule and lecithin cholesterol 
acyltransferase (LCAT) , despite the shared utilization of 
cholesterol. For the hACAT sequence and its murine 

30 homologs, a similarity to "signature' motifs of enzymes 

involved in acyl adenylation reactions was reported (8, 
12) . However, these sequences are unlikely to be 
critical, since they are not conserved in any homolog 
from any other organism. By contrast, there are regions 

35 of strong conservation between these molecules which may 

be critical to function. In the esterif ication 
defective, SRD4 mutant CHO cell line, the expressed but 
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defective A CAT allele encodes a single amino-acid 
substitution of leucine 265 lies in a conserved domain of 
human, rodent and yeast ACAT. Interestingly, this motif 
in ARGP1 is more degenerate, although the serine is 
5 conserved, the flanking sequence is conservatively 

replaced by similar residues. The ACAT reaction is 
probably mediated by a multimeric complex, as shown by 
radiation inactivation experiments (15) . Accordingly, 
the yeast and human sequences all possess "leucine zipper'* 

10 multimerization motifs. ARGP1 and ARGP2 lack a classical 
multimerization motif. Although protein phosphorylation 
as a mode of ACAT regulation has been refuted (16), a 
very strong region of conservation (consensus over 7 
sequences; LN E FGDR - F YGDWWN , single letter amino- 

15 acid code) predicts a tyrosine kinase consensus motif for 

phosphorylation. ARGP2 and ARGP1 are no exception to 
this. In particular the aspartic acid -tryptophan - 
tryptophan-asparagine (DWWN) sequence appears to be 
invariant (with the exception of S . pombe , where it is 

20 AWWN) and may represent an active site for the 

esterif ication reaction. These regions if conservation 
are targets for mutagenesis and in preliminary 
experiments appear critical to the activity of the ACAT 
and ARE enzymes (17) . 

25 

Why ACA T gene families? The role, if any, of these ACAT 
sequence homologs in sterol homeostasis is unclear. 
Since mouse macrophage ACAT is not critical to sterol 
esterif ication in the liver and intestine, it is possible 

30 that the additional enzymes evolved to recognize 

alternate substrates and thus promote sterol absorption 
in the intestine or production of lipoproteins by the 
liver (39) . Future experiments will be directed to 
complete the molecular characterization of these genes 

35 and test these hypotheses. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

<IJ APPLICANT: Stephen L. sturley 

(ii) TITLE OF INVENTION: DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 

ACYLTRANSFERASE 11 AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 
(A) ADDRESSEE: 

{B) STREET: 1185 Avenue of the Americas 
© CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(Ay MEDIUM TYPE: Floppy disk 
(B> COMPUTER: IBM PC compatible 
© OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: Herewith 
© CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: John P. White 

(B) REGISTRATION NUMBER: 28,678 

© REFERENCE/DOCKET NUMBER: 0575/50852-A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 
© TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3649 base pairs 

(B) TYPE: nucleic acid 
© STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGGTAGAGAC GGGGTTTCAC CGTGTTAGCC AGGATGGTCT GGATCTCCTG ACCTCGTGAT 60 
CCACCCACCT CGGCCTCCTA AAGTGCTGGG ATTACAGACA TGAGCCACCG CGCCCAGCCC 120 
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TATTCATCCC TTTTCAAAAG TCAGACCCTA GGAAGCTGGA GGGAGGTGGG GCATGGTTTT 180 

ACAGT GAATT TCTGATTTCA CTCAGGGTGA TAAAT CAGAC TCTTGGGGAA GCGGGTGGTG 240 

GCTCTGGACA GCAGCAGGAA TGGGGATCCA GTTAGCAACA AATCCATGGA CCTATGACAG 300 

GCTGAAAGCC ACCCCTTCTC CATCTTTGGG AGGTTGCCAA TGTCTGATTT AACACTATCC 360 

AATGAATGAT CATTGAAAGT AAAAAATAAC TATCAACTAG CAGAAAATAT AAATGGTAAG 420 

CATTAGCACA TATTTCACAT GTTTATATTT GGCTCTCAGA TTGACCTATA AAACAAAGTC 480 

TGGGAAATTC TATAT GATCC TGAAAAAATG ATACGCTGGT CTGGATGGTA GAATAAGTTG 540 

GAGAAATGTT TAAGCCAAAA TGCAGTCTTA CCAATGACTT TTTATTTTAT TTTATTAATT 600 

TTCAGGATTT TTGGTATACA GGTGGTTTTT GGTTACATGG AAAAGTTCTT TACTGGTGAT 660 

TTCTGAGATT TTAGTTCACC CCTTATCCTG AGCAGTGTAC ACTGTTCCCA ATATGTAGCC 720 

TTTTATCCCT CACCCCCTCT AAGTTCAAGA AGACTATGGT CCTGCAGAAA GCTTTATATG 7 80 

TAATTAACAT ATCTTTATCT TTATCTTTAT AGGCAGTAGA CTCATCTTTT GAAACAGATT 840 

CCATTAAGAG TGAAT GTGTA CCCTCCCTCT AGCC TTTATT ATTACTGTTT TTGCTATTAC 900 

ATGTGTTAGT GTATGTGAAT TTAATGCTTA AAAATGTATC CCATTGGCTA CTATGGCAAA 960 

AGGTTGACTC ATAAGAGTTT AGCACGGGTT AAGATCTGAA AGTTTTCTCC CAGCCTCTTA 1020 

TCACTGGCGC AGACTTCACA ATTCAT GGAA GCCACCAGTG AGATGACATT GCCTCAGGCA 1080 

GTTACTATTT TTATATTCTA TAACT CGAGG AGCTCAGGGT TTCGGAAATC ATTAAAC TT T 1140 

TTTTGTCCTT TTAAAGTTGG AGACAGCAAT TGTAGACAGC CTTCCAGTGG GTTATCTTTT 1200 

TGTGTCTCCT TACCTGTGGA GAAGCCTATT AGCTGGGATA TGTAGTTAAA TAGCTATATT 1260 

TATATATATC CAGGGCACCC CGAATTCGGG AGAGCTTCCC GGAGTCGACC TTCCTGCTGG 1320 

CTGCTCTGTG ACCGCTTCCC GGCTCTGCCC TCTTGGCCGA AGTGCCCGCT GCCGGGCGCG 1380 

GGCCTCAGAC AATACAATGG TGGGTGAAGA GAAGATGTCT CTAAGAAACC GGCTGTCAAA 1440 

GTCCAGGGAA AATCCTGAGG AAGATGAAGA CCAGAGAAAC CCTGCAAAGG AGTCCCTAGA 1500 

GACACCTAGT AATGGTCGAA TTGACATAAA ACAGTTGATA GCAAAGAAGA TAAAGTTGAC 1560 

AGCAGAGGCA GAGGAATTGA AGCCATTTTT TAT GAAGGAA GTTGGCAGTC ACTTT GATGA 1620 

TTTTGTGACC AATCTCATTG AAAAGTCAGC ATCATTAGAT AATGGTGGGT GCGCTCTCAC 1680 

AACCTTTTCT GTTCTTGAAG GAGAGAAAAA CAACCATAGA GCGAAGGATT TGAGAGCACC 1740 

TCCAGAACAA GGAAAGATTT TTATTGCAAG GCGCTCTCTC TTAGATGAAC TGCTTGAAGT 1800 

GGACCACATC AGAACAATAT ATCACATGTT TATTGCCCTC CTCATTCTCT TTATCCTCAG 1860 

CACACTTGTA GTAGATTACA TTGATGAAGG AAGGCTGGTG CTTGAGTTCA GCCTCCTGTC 1920 

TTATGCTTTT GGCAAATTTC CTACCGTTGT TTGGACCTGG TGGATCATGT TCCTGTCTAC 1980 

ATTTTCAGTT CCCTATTTTC TGTTTCAACA TTGGCGCACT GGCTATAGCA AGAGTTCTCA 2040 
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CGTTCTCTCT 


TCCATGGCTT 


TCTTTTCATG ATCTTCCAGA TTGGAGTTCT 


2100 


AGGTTTTGGA 


C CAACATATG 


TTGTGTTAGC 




f* C ft f C* ft f T* 1» 


CCCGGTTCAT 


2160 


CAT rATATTC 


GAGCAGATTC 


GTTTTGTAAT 




1 1 1 1 1 uA 


GAGAGAACGT 


2220 


GCCTCGGGTA 


CTAAATTCAG 


CTAAGGAGAA 




oil CCAATAC 


CTACAGTCAA 


2280 


CCAGTATTTG 


TACTTCTTAT 


TTGCTCCTAC 






ATCCCAGGAA 


2340 


TCCCACTGTA 


AGATGGGGTT 


ATGTCGCTAT 


GAAGTTT GCA 


CAGGTCTTTG 


GTTGCTTTTT 


2400 


CTATGTGTAC 


TACATCTTTG 


AAAGGCTTTG 


TGCCCCCTTG 


TTTCGGAATA 


TCAAACAGGA 


2460 


GCCCTTCAGC 


GCTCGTGTTC 


TGGTCCTATG 


TGGTATTTAA 


CTCCATCTTG 


CCAGGTGTGC 


2520 


TGATTCTCTT 


CCTTACTTTT 


TTTGCCTTTT 


TGCACTGCTG 


GCTCAATGCC 


TTTGCTGAGA 


2580 


TGTTACGCTT 


TGGTGACAGG 


ATGTT CTAT A 


AGGATTGGTG 


GAACTCCACG 


TCATACTCCA 


2640 


ACTATTATAG 


AACCTGGAAT 


GTGGTGGTCC 


ATGACTGGCT 


ATATTACTAT 


GCTTACAAGG 


2700 


ACTTTCTCTG 


GTTTTTCTCC 


AAGAGATTCA 


AATCTGCTGC 


CAT GTT AGCT 


GTCTTTGCTG 


2760 


TATCTGCTGT 


AGTACACGAA 


TATGCCTTGG 


CTGTTTGCTT 


GAGCTTTTTC 


TATCCCGTGC 


2820 


TGTTCGTGCT 


CTTCATGTTC 


TTTGGAATGG 


CTTTCAACTT 


CATTGT CAAT 


GATAGTCGGA 


2880 


AAAAGCCGAT 


TTGGAATGTT 


CTGATGTGGA 


CTTCTCTTTT 


CTTGGGCAAT 


GGAGT CTT AC 


2940 


TCTGCTTTTA 


TTCT CAAGAA 


TGGTATGCAC 


GTCGGCACTG 


TCCTCTGAAA 


AATCCCACAT 


3000 


TTTTGGATTA 


TGTCCGGCCA 


CGTTCCTGGA 


CTTGTC GTTA 


CGTGTTTTAG 


AAGCTTGGAC 


3060 


TTTGTTTCCT 


CCTTGTCACT 


GAAGATTGGG 


J.AGCTCCCTG 


ATTTGGAGCC 


AGCTGTTTCC 


3120 


AGTTGTTACT 


GAAGTTATCT 


GTGTTATTTG 


t>At LACT C C A 


GGCT TTAC AG 


ATGACTCACT 


3180 


CCATTCCTAG 


GTCACTTGAA 


GCCAAACTGT 


i ouArtl* 1 I UA 


CTGGAGTCTT 


GTACACT TAA 


3240 


GCAGAGCAGA 


ACTTTTTTTG 


TGGGGCTGGG 


TGGGGGGAGA 


AGACCGACTA 


ACAGCTGAAG 


3300 


TAAT GACAGA 


TTGTTGCTGG 


GTCATATCAG 


CTTTATCCCT 


TGGTAATTAT 


ATCTGTTTTG 


3360 


TTTCTT GACT 


CTGTCCAATC 


AGAGAATAAA 


CATCATAGTT 


TCTTGGCCAC 


TGAATTAGCC 


3420 


AAAACACTTA 


GGAAGAAATC 


ACTTAAATAC 


CTCTGGCTTA 


GAAATTTTTT 


CATGCACACT 


3480 


GTT GGAATGT 


ATGCTAATTG 


AACATGCAAT 


TGGGGAAGAA 


AAAATGTAGA 


AT GATTTTT G 


3540 


CTATTTCTAG 


TAGAAAGAAA 


ATGTCTGTTT 


TCCAAAGATA 


AT GTTATACA 


TCCTATTTTG 


3600 


TAATT TTTTT 


GAAAAAAGTT 


CAATGTTCAG 


TTTTCCTTAGT TTTTACCTT 




3660 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii> MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Gly Glu Glu Lys Met Ser Leu Arg Asn Arg Leu Ser Lys Ser 
15 io is 

Arg Glu Asn Pro Glu Glu Asp Glu Asp Gin Arg Asn Pro Ala Lys Glu 
20 25 30 

Ser Leu Glu Thr Pro Ser Asn Gly Arg lie Asp lie Lys Gin Leu lie 
35 40 45 

Ala Lys Lys lie Lys Leu Thr Ala Glu Ala Glu Glu Leu Lys Pro Phe 
50 55 60 

Phe Met Lys Glu Val Gly Ser His Phe Asp Asp Phe Val Thr Asn Leu 
65 70 75 80 

lie Glu Lys Ser Ala Ser Leu Asp Asn Gly Gly Cys Ala Leu Thr Thr 
85 90 95 

Phe Ser Val Leu Glu Gly Glu Lys Asn Asn His Arg Ala Lys Asp Leu 
100 105 110 

Arg Ala Pro Pro Glu Gin Gly Lys lie Phe lie Ala Arg Arg Ser Leu 
115 120 125 

Leu Asp Glu Leu Leu Glu Val Asp His He Arg Thr He Tyr His Met 
130 135 140 

Phe He Ala Leu Leu He Leu Phe He Leu Ser Thr Leu Val Val Asp 
145 150 155 160 

Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Ser Leu Leu Ser Tyr 
165 170 175 

Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp Trp He Met Phe 
180 185 190 

Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin His Trp Arg Thr 
195 200 205 

Gly Tyr Ser Lys Ser Ser His Pro Leu He Arg Ser Leu Phe His Gly 
210 215 220 

Phe Leu Phe Met He Phe Gin He Gly Val Leu Gly Phe Gly Pro Thr 
225 230 235 240 

Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser Arg Phe He He 
245 250 255 

He Phe Glu Gin He Arg Phe Val Met Lys Ala His Ser Phe Val Arg 
260 265 270 

Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu Lys Ser Ser Thr 
275 280 285 

Val Pro He Pro Thr Val Asn Gin Tyr Leu Tyr Phe Leu Phe Ala Pro 
290 295 300 
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Thr Leu lie Tyr Arg Asp Ser Tyr Pro Arg Asn Pro Thr Val Arq TrD 
305 310 315 320 

Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly Cys Phe Phe Tyr 
325 330 335 

Val Tyr Tyr lie Phe Glu Arg Leu Cys Ala Pro Leu Phe Arg Asn He 
340 345 350 

Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu Cys Val Phe Asn 
355 360 365 

Ser He Leu Pro Gly Val Leu He Leu Phe Leu Thr Phe Phe Ala Phe 
370 375 380 

Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu Arg Phe Gly Asp 
385 390 395 400 

Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser Tyr Ser Asn Tyr 
405 410 415 

Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu Tyr Tyr Tyr Ala 
420 425 430 

Tyr Lys Asp Phe Leu Trp Phe Phe Ser Lys Arg Phe Lys Ser Ala Ala 
435 440 445 

Met Leu Ala Val Phe Ala Val Ser Ala Val Val His Glu Tyr Ala Leu 
450 455 460 

Ala Val Cys Leu Ser Phe Phe Tyr Pro Val Leu Phe Val Leu Phe Met 
465 470 475 480 

Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp Ser Arg Lys Lys 
485 490 495 

Pro He Trp Asn Val Leu Met Trp Thr Ser Leu Phe Leu Gly Asn Gly 
500 505 510 

Val Leu Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala Arg Arg His Cys 
515 520 525 

Pro Leu Lys Asn Pro Thr Phe Leu Asp Tyr Val Arg Pro Arg Ser Trp 
530 535 540 

Thr Cys Arg Tyr Val Phe 
545 550 



INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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GCCCTCCAGC TCTCTACTAA GACCGGTCGC AAGCATGCTG GGC GATATAT CCAAACCACA 60 

CCACACATGG TCTCCCTCCT GCGT CAAAAT CTCCCCAGAC AGTCCGGACC CGCACCCGAT 120 

ATCCAGAATG AAACTGCACG GCTGCAGATT CAAAAGCTCC AACGCCCTCA GCGTCATCTT 180 

CGCCTGGATA TGCTGCACTC TGGTCGAACC CGTGTACTTG TGTGCTTCGC TATCATTATA 240 

GAAAATCTCC GGTGGTGCCA ACTCCTCAGG ACGTGACATT ATTTCTTCTC TGATATATTT 300 

CCTGTGTTTC CGTACCGCAC CTTTTTAGCA CTACTTTTTT ACTATGCTCT TCTTCTTCTG 360 

CTTCTTCTGC TTTTTTCCTC TTTATCACAC TATGTATGTG CTGCTCATCT CTTCTTTTTA 420 

TCGATAAAAT TGAAAAATGT GAGATGGTGT AGAGTGAAAA AAAAAAAAAA ATCTGGCTTG 480 

GCCATCAAAT ACCCGGCCGT GGTTGGACTC GTTTAGCGAA CAATAGCACC CAGCAGACCC 54 0 

TGGCAACATG CGGATGATAT AAGAAGGACG AGCGTGGTGG AGGAAAGGGG CGCCATTGGC 600 

ACACTCACGC AGGT GGTTGT TCAGCACGGC TTGCAGCAAG AGCGCCAAAA CAGATTGCAA 660 

GAAT GACGGA GACTAAGGAT TTGTTGCAAG ACGAAGAGTT TCTTAAGAT C CGCAGACTCA 720 

ATT C CGCAGA AGCCAACAAA CGGCATTCGG TCACGTACGA TAACGT GATC CTGCCACAGG 7 80 

AGTCCAT GGA GGTTTCGCCA CGGTCGTCTA CCACGTCGCT GGTGGAGCCA GTGGAGTCGA 840 

CTGAAGGAGT GGAGTCGACT GAGGC GGAAC GTGTGGCAGG GAAGCAGGAG CAGGAGGAGG 900 

AGTACCCTGT GGACGCCCAC AT G CAAAAGT ACCTTTCACA CCTGAAGAGC AAGTCTCGGT 960 

CGAGGTTCCA CCGAAAGGAT GCTAGCAAGT ATGTGTCGTT TTTTGGGGAC GTGAGTTTTG 1020 

ATCCTCGCCC CACGCTCCTG GACAGCGCCA TCAACGTGCC CTTCCAGACG ACTTTCAAAG 1080 

GTCCGGTGCT GGAGAAACAG CTCAAAAATT TACAGTTGAC AAAGACCAAG ACCAAGGCCA 1140 

CGGTGAAGAC TACGGTGAAG ACTACGGAGA AAACGGACAA GGCAGATGCC CCCCCAGGAG 1200 

AAAAACTGGA GTCGAACTTT TCAGGGATCT ACGTGTTCGC ATGGATGTTC TTGGGCTGGA 1260 

TAGC CATCAG GTGCTGCACA GATTACTATG CGTCGTACGG CAGTGCATGG AATAAGCTGG 1320 

AAATCGTGCA GTACATGACA ACGGACTTGT TCACGATCGC AATGTTGGAC TTGGCAATGT 1380 

TCCTGTGCAC TTTCTTCGTG GTTTTCGTGC ACTGGCTGGT GAAAAAGCGG ATC AT CAACT 1440 

GGAAGTGGAC TGGGTTCGTT GCAGTGAGCA TCTTCGAGTT GGCTTTCATC CCCGTGACGT 1500 

TCCCCATTTA CGTCTACTAC TTT GATTTCA ACTGGGTCAC GAGAATCTTC CTGTTCCTGC 1560 

ACTCCGTGGT GTTTGTTATG AAGAGCCACT CGTTTGCCTT TTACAACGGG TATCTTTGGG 1620 

ACATAAAGCA GGAACTCGAG TACTCTTCCA AACAGTTGCA AAAATACAAG GAATCTTTGT 1680 

CCCCAGAGAC CCGCGAGATT CTGCAAAAAA GTTGCGACTT TTGCCTTTTC GAATTGAACT 1740 

ACCAGACCAA GGATAACGAC TTCCCCAACA ACAT CAGTTG CAGCAATTTC TTCATGTTCT 1800 

GTTTGTTCCC CGTCCTCGTG TACCAGATCA ACTAC CCAAG AACGTCGCGC ATCAGAT GGA 1860 

GGTATGTGTT GGAGAAGGTG TGCGCCATCA TTGGCACCAT CTTCCTCATG ATGGTCACGG 1920 
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CACAGTTCTT CATGCACCCG GTGGCCATGC GCTGTATCCA GTTCCACAAC ACGCCCACCT 1980 

TCGGCGGCTG GATCCCCGCC ACGCAAGAGT GGTTCCACCT GCTCTTCGAC ATGATTCCGG 2040 

GCTTCACTGT TCTGTACATG CTCACGTTTT ACATGATATG GGACGCTTTA TTGAATTGCG 2100 

TGGCGGAGTT GACCAGGTTT GCGGACAGAT ATTTCTACGG CGACTGGTGG AATTGCGTTT 2160 

CGTTTGAAGA GTTTAGCAGA ATCTGGAACG TCCCCGTTCA CAAATTTTTA CTAAGACACG 2220 

TGTACCACAG CTCCATGGGC GCATTGCATT TGAGCAAGAG CCAAGCTACA TTATTTACTT 2280 

TTTTCTTGAG TGCCGTGTTC CACGAAATGG CCATGTTCGC CATTTTCAGA AGGGTTAGAG 234 0 

GATATCTGTT CATGTTCCAA CTGTCGCAGT TTGTGTGGAC TGCTTTGAGC AACACCAAGT 2400 

TTCTACCGGC AAGACC GCAG TTGTCCAACG TTGTCTTTTC GTTTGGTGTC TGTTCAGGGC 24 60 

CCAGTATCAT TATGACGTTG TACCTGACCT TATGAACTGC CACCATACCA CGTGTGTCCC 2520 

TCGCAAGCCC TTGATAGATA TACAATAGGG AATGGGCGTC CGTCCACCGT GGTCAAAGAC 2580 
AGGGGCAAAG AGCTCCTAGG T 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Glu Thr Lys Asp Leu Leu Gin Asp Glu Glu Phe Leu Lys lie 
15 10 15 

Arg Arg Leu Asn Ser Ala Glu Ala Asn Lys Arg His Ser Val Thr Tyr 
20 25 30 

Asp Asn Val lie Leu Pro Gin Glu Ser Met Glu Val Ser Pro Arg Ser 
35 40 45 

Ser Thr Thr Ser Leu Val Glu Pro Val Glu Ser Thr Glu Gly Val Glu 
50 55 60 

Ser Thr Glu Ala Glu Arg Val Ala Gly Lys Gin Glu Gin Glu Glu Glu 
65 70 75 80 

Tyr Pro Val Asp Ala His Met Gin Lys Tyr Leu Ser His Leu Lys Ser 
85 90 95 

Lys Ser Arg Ser Arg Phe His Arg Lys Asp Ala Ser Lys Tyr Val Ser 
100 105 110 

Phe Phe Gly Asp Val Ser Phe Asp Pro Arg Pro Thr Leu Leu Asp Ser 
115 120 125 
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Ala He Asn Val Pro Phe Gin Thr Thr Phe Lys Gly Pro Val Leu Glu 
130 135 140 

Lys Gin Leu Lys Asn Leu Gin Leu Thr Lys Thr Lys Thr Lys Ala Thr 
145 150 155 160 

Val Lys Thr Thr Val Lys Thr Thr Glu Lys Thr Asp Lys Ala Asp Ala 
165 170 175 

Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser Gly He Tyr Val Phe 
180 185 190 

Ala Trp Met Phe Leu Gly Trp He Ala He Arg Cys Cys Thr Asp Tyr 
195 200 205 

Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu Glu He Val Gin Tyr 
210 215 220 

Met Thr Thr Asp Leu Phe Thr He Ala Met Leu Asp Leu Ala Met Phe 
225 230 235 240 

Leu Cys Thr Phe Phe Val Val Phe Val His Trp Leu Val Lys Lys Arg 
245 250 255 

He He Asn Trp Lys Trp Thr Gly Phe Val Ala Val Ser He Phe Glu 
260 265 270 

Leu Ala Phe He Pro Val Thr Phe Pro He Tyr Val Tyr Tyr Phe Asp 
275 280 285 

Phe Asn Trp Val Thr Arg He Phe Leu Phe Leu His Ser Val Val Phe 
290 295 300 

Val Met Lys Ser His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Trp Asp 
305 310 315 320 

He Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin Leu Gin Lys Tyr Lys 
325 330 335 

Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu Gin Lys Ser Cys Asp 
340 345 350 

Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys Asp Asn Asp Phe Pro 
355 360 365 

Asn Asn He Ser Cys Ser Asn Phe Phe Met Phe Cys Leu Phe Pro Val 
370 375 380 

Leu Val Tyr Gin He Asn Tyr Pro Arg Thr Ser Arg He Arg Trp Arg 
385 390 395 400 

Tyr Val Leu Glu Lys Val Cys Ala He He Gly Thr He Phe Leu Met 
405 410 415 

Met Val Thr Ala Gin Phe Phe Met His Pro Val Ala Met Arg Cys He 
420 425 430 

Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp He Pro Ala Thr Gin 
435 440 445 



Glu Trp Phe His Leu Leu Phe Asp Met He Pro Gly Phe Thr Val Leu 
450 455 460 
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Tyr Met Leu Thr Phe Tyr Met lie Trp Asp Ala Leu Leu Asn Cys Val 
465 470 475 480 

Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
485 490 495 

Asn Cys Val Ser Phe Glu Glu Phe Ser Arg lie Trp Asn Val Pro Val 
500 505 510 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Gly Ala Leu 
515 520 525 

His Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr Phe Phe Leu Ser Ala 
530 535 540 

Val Phe His Glu Met Ala Met Phe Ala lie Phe Arg Arg Val Arg Gly 
545 550 555 560 

Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val Trp Thr Ala Leu Ser 
565 570 575 

Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu Ser Asn Val Val Phe 
580 585 590 

Ser Phe Gly Val Cys Ser Gly Pro Ser lie lie Met Thr Leu Tyr Leu 
595 600 605 

Thr Leu 
610 

(2J INFORMATION FOR SEQ ID NO:5: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TATAAAATTC CTTTCATCAA TACATCTATA TATTCGAATA TATAGATAAA CCAATACAAA 60 

AACATACTGA AATTTTTTGA AAACAACTAA AACTATTCAT TGCAGTTACA CGTGAATGCT 120 

AAACTTTATA TCGCTCTTGT CGGTCCCGCG GAGTTAACAT TTAACGGCTT CTCGCGCAAT 180 

AACCGGAAAA ATTC CAACAG TTTCTTTGTA ATATTATTAA GCCTTCTTTT TTCCCGGAAT 240 

CTATAAGAGG GGACGAAAAT TAGCCGCTAT TAATTCTGGT ATTGCCACCT AGACAAGAAG 300 

TAAACAGACA CATTACGTTA GCAAAAGCAA CAATAACAAA CACAACCATG GACAAGAAGA 360 

AGGATCTACT GGAGAACGAA CAATTTCTCC GCATCCAAAA GCTCAACGCT GCCGATGCGG 420 

GCAAAAGACA AT CT ATAACA GTGGAC GACG AGGGCGAACT AT AT GGGTT A GACACCTCCG 480 

GCAACTCACC AGCCAATGAA CACACAGCTA CCACAATTAC ACAGAATCAC AGCGTGGTGG 540 
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CCTCAAACGG 


AGACGTCGCA 


TTCATCCCAG 


GAACTGCTAC 


CGAAGGCAAT 


ACAGAGATTG 


600 


TAACTGAAGA 


AGTGATT GAG 


ACCGATGATA 


ACATGTTCAA 


GACCCATGTG 


AAGACTTTAA 


660 


GCTCCAAAGA 


GAAGGCACGG 


TATAGGCAAG 


GGTCCTCTAA 


CTTTATATCG 


TATTTCGATG 


720 


ATATGTCATT 


TGAACACAGG 


CCCAGTATAT 


TAGATGGGTC 


AGTTAACGAG 


CCCTTCAAGA 


780 


CCAAATTCGT 


GGGACCTACT 


TTAGAAAAGG 


AGAT CAGAAG 


AGGGGAGAAA 


GAGCTAATGG 


840 


CCATGCGCAA 


AAATTTACAC 


CACCGCAAGT 


CCTCCCCAGA 


TGCTGTCGAC 


TCAGTAGGGA 


900 


AAAATGATGG 


CGCCGCCCCA 


ACTACTGTTC 


CAACTGCCGC 


CACCTCAGAA 


ACGGTGGTCA 


960 


CCGTTGAAAC 


CACCATAATT 


TCATCCAATT 


TCTCCGGGTT 


GTACGTGGCG 


TTTTGGATGG 


1020 


CTATTGCATT 


TGGTGCTGTC 


AAGGCTTTAA 


TAGACTATTA 


TTACCAGCAT 


AAT GGTAGCT 


1080 


TCAAGGATTC 


GGAGATCTTG 


AAATTTATGA 


CTACGAATTT 


GTTCACTGTG 


GCATC CGTAG 


1140 


ATCTTTTGAT 


GTATTTGAGC 


ACTTATTTTG 


TCGTTGGAAT 


ACAATACTTA 


TGCAAGTGGG 


1200 


GGGTCTTGAA 


ATGGGGCACT 


AC CGGCTGGA 


TCTTCACCTC 


AATTTACGAG 


TTTTTGTTTG 


1260 


TTATCTTCTA 


CATGTATTTA 


ACAGAAAACA 


TCCTAAAACT 


ACACTGGCTG 


TCCAAGATCT 


1320 


TCCTTTTTTT 


GCATTCTTTA 


GTTTTATTGA 


TGAAAATGCA 


TTCTTTCGCC 


TTCTACAATG 


1380 


G CT AT CT AT G 


GGGTATAAAG 


GAAGAACTAC 


AATTTTCCAA 


AAGCGCTCTT 


GCCAAATACA 


1440 


AGGATTCTAT 


AAATGATCCA 


AAAGTTATTG 


GTGCTCTTGA 


GAAAAGCTGT 


GAGTTTTGTA 


1500 


GTTTTGAATT 


GAGCTCTCAG 


TCTTTAAGCG 


ACCAAACTCA 


AAAATTCCCC 


AACAATATCA 


1560 


GTGCAAAAAG 


CTTTTTTTGG 


TTCACCATGT 


TTCCAACCCT 


AATTTACCAA 


ATTGAATATC 


1620 


CAAGAACTAA 


GGAAAT CAGA 


TGGAGCTACG 


TATTAGAAAA 


GATCTGCGCC 


ATCTTCGGTA 


1680 


CCATTTTCTT 


AATGAT GATA 


GATGCTCAAA 


TCTTGATGTA 


TCCTGTAGCA 


AT GAGAGC AT 


1740 


TGGCTGTGCG 


CAATTCT GAA 


TGGACTGGTA 


TATTGGATAG 


ATTATTGAAA 


TGGGTTGGAT 


1BO0 


TGCTCGTTGA 


TATCGTCCCA 


GGGTTTATCG 


TGATGTACAT 


CTTGGACTTC 


TATTT GATTT 


1860 


GGGATGCCAT 


TTTGAACTGT 


GTGGCTGAAT 


TGACAAGATT 


TGGCGACAGA 


TATTTCTACG 


1920 


GTGACTGGTG 


GAATTGTGTT 


AGTTGGGCAG 


ACTTCAGTAG 


AATTTGGAAC 


ATCCCAGTGC 


1980 


ATAAGTTTTT 


GTTAAGACAT 


GTTTACCATA 


GTTCAATGAG 


TTCATTCAAA 


TTGAACAAGA 


2040 


GTCAAGCAAC 


TTTGATGACC 


TTTTTCTTAA 


GTTCCGTCGT 


TCATGAATTA 


GCAATGTACG 


2100 


TTATCTTCAA 


GAAATTGAGG 


TTTTACTTGT 


TCTTCTTCCA 


AATGCTGCAA 


AT GC CATT A G 


7 1 fid 


TAGCTTTAAC 


AAATACTAAA 


TTCATGAGGA 


ACAGAAC CAT 


AAT CGGAAAT 


GTTATTTTCT 


2220 


GGCTCGGTAT 


CTGCATGGGA 


CCAAGTGTCA 


TGTGTACGTT 


GTACTTGACA 


TTCTAAGGCA 


2280 


TCCTGCAACT 


GTTCTGTGGA 


GCTATTAAAT 


CTTTATAGTA 


AATTTTTTTT 


TACTTTTTTT 


2340 


TTTTTTTTTT 


TTTTTTTTTA 


TTATTTACAA 


GCGTCTATAT 


ATTTTCTATT 


ATAGAATATT 


2400 


GTCATTTATT 


ACATTGGTTC 


A 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 amino acids 
(BJ TYPE: amino acid 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Lys Lys Lys Asp Leu Leu Glu Asn Glu Gin Phe Leu Arg lie 
15 10 15 

Gin Lys Leu Asn Ala Ala Asp Ala Gly Lys Arg Gin Ser lie Thr Val 
20 25 30 

Asp Asp Glu Gly Glu Leu Tyr Gly Leu Asp Thr Ser Gly Asn Ser Pro 
35 40 45 

Ala Asn Glu His Thr Ala Thr Thr lie Thr Gin Asn His Ser Val Val 
50 55 60 

Ala Ser Asn Gly Asp Val Ala Phe lie Pro Gly Thr Ala Thr Glu Gly 
65 70 75 80 

Asn Thr Glu lie Val Thr Glu Glu Val lie Glu Thr Asp Asp Asn Met 
85 90 95 

Phe Lys Thr His Val Lys Thr Leu Ser Ser Lys Glu Lys Ala Arg Tyr 
100 105 110 

Arg Gin Gly Ser Ser Asn Phe lie Ser Tyr Phe Asp Asp Met Ser Phe 
115 120 125 

Glu His Arg Pro Ser lie Leu Asp Gly Ser Val Asn Glu Pro Phe Lys 
130 135 140 

Thr Lys Phe Val Gly Pro Thr Leu Glu Lys Glu lie Arg Arg Arg Glu 
145 150 155 160 

Lys Glu Leu Met Ala Met Arg Lys Asn Leu His His Arg Lys Ser Ser 
165 170 175 

Pro Asp Ala Val Asp Ser Val Gly Lys Asn Asp Gly Ala Ala Pro Thr 
180 185 190 

Thr Val Pro Thr Ala Ala Thr Ser Glu Thr Val Val Thr Val Glu Thr 
195 200 205 

Thr lie lie Ser Ser Asn Phe Ser Gly Leu Tyr Val Ala Phe Trp Met 
210 215 220 

Ala He Ala Phe Gly Ala Val Lys Ala Leu He Asp Tyr Tyr Tyr Gin 
225 230 235 240 



His Asn Gly Ser Phe Lys Asp Ser Glu He Leu Lys Phe Met Thr Thr 
245 250 255 
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Asn Leu Phe Thr Val Ala Ser Val Asp Leu Leu Met Tyr Leu Ser Thr 
260 265 270 

Tyr Phe Val Val Gly lie Gin Tyr Leu Cys Lys Trp Gly Val Leu Lys 
275 280 285 

Trp Gly Thr Thr Gly Trp lie Phe Thr Ser He Tyr Glu Phe Leu Phe 
290 295 300 

Val He Phe Tyr Met Tyr Leu Thr Glu Asn He Leu Lys Leu His Trp 
305 310 315 320 

Leu Ser Lys He Phe Leu Phe Leu His Ser Leu Val Leu Leu Met Lys 
325 330 335 

Met His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Trp Gly He Lys Glu 
340 345 350 

Glu Leu Gin Phe Ser Lys Ser Ala Leu Ala Lys Tyr Lys Asp Ser He 
355 360 365 

Asn Asp Pro Lys Val He Gly Ala Leu Glu Lys Ser Cys Glu Phe Cys 
370 375 380 

Ser Phe Glu Leu Ser Ser Gin Ser Leu Ser Asp Gin Thr Gin Lys Phe 
385 390 395 400 

Pro Asn Asn He Ser Ala Lys Ser Phe Phe Trp Phe Thr Met Phe Pro 
405 410 415 

Thr Leu He Tyr Gin He Glu Tyr Pro Arg Thr Lys Glu He Arg Trp 
420 425 430 

Ser Tyr Val Leu Glu Lys He Cys Ala He Phe Gly Thr He Phe Leu 
435 440 445 

Met Met He Asp Ala Gin lie Leu Met Tyr Pro Val Ala Met Arg Ala 
450 455 460 

Leu Ala Val Arg Asn Ser Glu Trp Thr Gly He Leu Asp Arg Leu Leu 
465 470 475 480 

Lys Trp Val Gly Leu Leu Val Asp He Val Pro Gly Phe He Val Met 
485 490 495 

Tyr He Leu Asp Phe Tyr Leu He Trp Asp Ala He Leu Asn Cys Val 
500 505 510 

Ala Glu Leu Thr Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
515 520 525 

Asn Cys Val Ser Trp Ala Asp Phe Ser Arg He Trp Asn He Pro Val 
530 535 540 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Ser Ser Phe 
545 550 555 560 

Lys Leu Asn Lys Ser Gin Ala Thr Leu Met Thr Phe Phe Leu Ser Ser 
565 570 575 

Val Val His Glu Leu Ala Met Tyr Val He Phe Lys Lys Leu Arg Phe 
580 585 590 



WO 97/45439 



PCT/US97/09460 



-68- 

Tyr Leu Phe Phe Phe Gin Met Leu Gin Met Pro Leu Val Ala Leu Thr 
595 600 605 

Asn Thr Lys Phe Met Arg Asn Arg Thr He He Gly Asn Val He Phe 
610 615 620 

Trp Leu Gly He Cys Met Gly Pro Ser Val Met Cys Thr Leu Tyr Leu 
625 630 635 640 

Thr Phe 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGAGCTCA ACTTTCCCCG CTCTCCCCGC ATCCGGAAGC GCTTTCTGCT GCGACGGATC 60 

CTTGAGATGC TGTTCTTCAC CCAGCTCCAG GTGGGGCTGA TCCAGCAGTG GATGGTCCCC 120 

ACCATCCAGA ACTCCATGAA GCCCTTCAAG GACATGGACT ACTCACGCAT CATCGAGCGC 180 

CTCCTGAAGC TGGCGGTCCC CAATCACCTC ATCTGGCTCA TCTTCTTCTA CTGGCTCTTC 240 

CACTCCTGCC TGAATGCCGT GGCT GAGCTC ATGCAGTTTG GAGACCGGGA GTTCTAC CGG 300 

GACTGGTGGA ACTCCGAGTC TGTCACCTAC TTCTGGCAGA ACT GGAACAT CCCTGTGCAC 360 

AAGTGGTGCA TCAGACACTT CTACAAGCCC ATGCTTCGAC GGGGCAGCAG CAAGTGGATG 420 

GCCAGGACAG GGGTGTTCCT GGCCTCGGCT TTCTTCCACG AGTACCTGGT GAGCGTCCCT 480 

CTGCGAATGT TCCGCCTCTG GGCTTTCACG GGCATGATGG CTCAGATCCC ACTGGCCTGG 540 

TTCGTGGGCC GCTTTTTCCA GGGCAACTAT GGCAACGCAG CTGTGTGGCT GTCGCTCATC 600 

AT CGGACAGC CAATAGCCGT CCTCATGTAC GTCCACGAAC TACTACGTGC TCAACTATGA 660 

GGCCCCAGCG GCAGAGGCCT GAGCTGCACC TGAGGGCCTG GCTTCTCACT GCCACCTCAA 720 

ACCCGCTGCC AGAGCCCACC TCTCCTCCTA GGCCTCGAGT GCTGGGGATG GGCCTGGCTG 780 

CACAGCATCC TCCTCTGGTC CCAGGGAGGC CTCTCTGCCC TATGGGGCTC TGTCCTGCAC 840 

CCCTCAGGGA TGGCGACAGC AGGCCAGACA CAGTCTGATG CCAGCTGGGA GTCTTGCTGA 900 

CCCTGCCCCG GGTCCGAGGG TGTCAATAAA GTGCTGTCCA GTGAGAAAAA GAAAAAAAAA 960 
AAAAAAAAAA ATTCTGCGGC CGC 
(2) INFORMATION FOR SEQ ID NO: 8: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Amino Acid 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Leu Asn Phe Pro Arg Ser Pro Arg lie Arg Lys Arg Phe Leu 
15 10 15 

Leu Arg Arg lie Leu Glu Met Leu Phe Phe Thr Gin Leu Gin Val Gly 
20 25 30 

Leu lie Gin Gin Trp Met Val Pro Thr lie Gin Asn Ser Met Lys Pro 
35 40 45 

Phe Lys Asp Met Asp Tyr Ser Arg lie lie Glu Arg Leu Leu Lys Leu 
50 55 60 

Ala Val Pro Asn His Leu lie Trp Leu lie Phe Phe Tyr Trp Leu Phe 
65 70 75 80 

His Ser Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg 
85 90 95 

Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp 
100 105 110 

Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg His Phe Tyr 
115 120 125 

Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Thr Gly 
130 135 140 

Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr Leu Val Ser Val Pro 
145 150 155 160 

Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Gly Met Met Ala Gin He 
165 170 175 

Pro Leu Ala Trp Phe Val Gly Arg Phe Phe Gin Gly Asn Tyr Gly Asn 
180 185 190 

Ala Ala Val Trp Leu Ser Leu He He Gly Gin Pro He Ala Val Leu 
195 200 205 

Met Tyr Val His Glu Leu Leu Arg Ala Gin Leu 
210 215 

) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

ATGTTGAACT TCATGATGCA TGACCAGCGC ACCGGCCCGG CAT GGAAC GT GCTGATGTGG 60 

ACCATGCTGT TTCTAGGCCA GGGAATCCAG GTCAGCCTGT ACTGCCAGGA GTGGTACGCA 120 

CGGACGCACT GtCCCTTACC CCAGGCAACT TTCTGGGGGC TGGTGACACC TCGATCTTGG 180 

TCCTGCCATA CCTAGAGGTC GGGACAGACG ACGCTACCTG CCCAGACACC ACCAAGTTCT 240 

CTGCCTGCAA AACCTGGGGA CCAGGACTTC CTGTCTTGCA TTCCCAAATT TGGGTTCTTG 300 

AGTCGAGGCA ACCTTGCACA CAAGACCCCA CCAAGGGATT GTTGCAAGGG ATTAGATTTT 360 

GCAGATTTGT TGGGTAATGA TTCAACGACT CAGCTGGGGG TTGACCAGGG TTGATTTTTC 42 0 

AATCCTTTTC CCCTGGGTTT GGGTTACAGG TTTTT 455 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu Asn Phe Met Met His Asp Gin Arg Thr Gly Pro Ala Trp Asn 
15 10 15 

Val Leu Met Trp Thr Met Leu Phe Leu Gly Gin Gly lie Gin Val Ser 
20 25 30 

Leu Tyr Cys Gin Glu Trp Tyr Ala Arg Thr His Cys Pro Leu Pro Gin 
35 40 45 

Ala Thr Phe Trp Gly Leu Val Thr Pro Arg Ser Trp Ser Cys His Thr 

50 55 60 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGGACAACG CGGGGTCTGA TACGACTCAC TATAGGGAAT TTGGCCCTCG AGCAGTAGAT 60 

TCGGCACGAT GGGCACGAGG ACTCCATCAT GTTCCTCAAG CTTTATTCCT ACCGGGATGT 120 

CAACCTGTGG TGCCGCCAGC GAAGGGTCAA GGCCAAAGCT GTCTCTACAG GGAAGAAGGT 180 

CAGT GGGGCT GCTGCGAGCA AGCTGTGAGC TATCCAGACA ACCTGACCTA CCGAGATCTC 240 

GATTACTTCA TCTTTGCTCC TACTTTGTGT TATGAACTCA ACTTTCCTCG GTCCCCCCGA 300 

AT AC GAGAGC GCTTTCTGCT AC GAC GAGTT CTTGAGATGC TCTTTTTTAC CCAGCTTCAA 360 

GTGGGGCTGA TCCAACAGTG GATGGTCCCT ACTATCCAGA ACTCCATGGA AGCCCTTTCA 420 

AGAGCTTCTG GCAGTTTTGG AGACCGCGAG TTCTACAGAG ATTGGTGGAA TGCTGAGTCT 480 

GTCACCGACT TTT GGCAGAA CTGGAATATC CCCGTGG 517 
(2) INFORMATION FOR SEQ ID NO: 12: 

tii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asp Asn Ala Gly Ser Asp Thr Thr His Tyr Arg Glu Phe Gly Pro 
15 10 15 

Arg Ala Val Asp Ser Ala Arg Trp Ala Arg Gly Leu His His Val Pro 
20 25 30 

Gin Ala Leu Phe Leu Pro Gly Cys Gin Pro Val Val Pro Pro Ala Lys 
35 40 45 

Gly Gin Gly Gin Ser Cys Leu Tyr Arg Glu Glu Gly Gin Trp Gly Cys 
50 55 60 

Cys Glu Gin Ala Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu 
65 70 75 80 

Asp Tyr Phe He Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro 
85 90 95 

Arg Ser Pro Arg He Arg Glu Arg Phe Leu Leu Arg Arg Val Leu Glu 
100 105 110 

Met Leu Phe Phe Thr Gin Leu Gin Val Gly Leu He Gin Gin Trp Met 
115 120 125 

Val Pro Thr He Gin Asn Ser Met Glu Ala Leu Ser Arg Ala Ser Gly 
130 135 140 

Ser Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser 
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145 150 155 160 

Val Thr Asp Phe Trp Gin Asn Trp Asn lie Pro Val 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Lys Asp Leu Leu Glu Phe Leu Lys He Arg Leu Asn Ala Asp Ala 
15 10 15 

Lys Arg Ser Thr Asp Ser Pro Thr Val Ser Glu Val Glu Arg Gly Lys 
20 25 30 

Gin Glu He Glu Ala His Lys Ser Lys Lys Arg Phe Arg Ser Phe Ser 
35 40 45 

Phe Phe Asp Ser Phe Glu Arg Pro Ser Leu Leu Asp Gly Asn Pro Phe 
50 55 ' 60 

Thr Thr Phe Gly Pro Val Leu Glu Lys Glu Lys Asn Leu His Lys Lys 
65 70 75 80 

Lys Thr Thr Val Thr Asp Val Ser Asn Phe Ser Gly He Tyr Val Phe 
85 90 95 

Trp Met Leu Ala Leu Asp Tyr Tyr Gly Glu He Leu Tyr Met Thr Thr 
100 105 HO 

Leu Phe Thr Val Ala Asp Leu Met Phe Leu Ser Thr Phe Phe Val Val 
115 120 125 

Leu Lys Trp Thr Gly He Ser He Glu Phe Leu Phe He Phe Leu Trp 
130 135 140 

Ser Arg He Phe Leu Phe Leu His Ser Val Phe Val Met Lys His Ser 
145 150 155 160 

Phe Ala Phe Tyr Asn Gly Tyr Leu Trp He Lys Glu Glu Leu Ser Leu 
165 170 175 

Lys Tyr Lys Glu Ser Ser Pro Leu Gin Lys Ser Cys Phe Cys Phe Glu 
180 185 190 

Leu Gin Phe Pro Asn Asn He Ser Phe Phe Phe Phe Pro Thr Leu He 
195 200 205 

Tyr Gin He Tyr Pro Arg Thr He Arg Trp Tyr Val Leu Glu Lys Cys 
210 215 220 



Ala He Phe Gly Thr He Phe Leu Met Met Ala Gin Met Pro Val Ala 
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225 230 235 240 

Met Arg Asn Phe Trp Gin Leu Leu Asp lie Pro Gly Phe Val Leu Tyr 
245 250 255 

Leu Thr Phe Tyr lie Trp Asp Ala Leu Asn Cys Val Ala Glu Leu Thr 
260 265 270 

Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Val Ser 
275 2B0 285 

Phe Ser Arg lie Trp Asn Val Pro Val His Lys Phe Leu Leu Arg His 
290 295 300 

Val Tyr His Ser Ser Met Phe Lys Leu Lys Ser Gin Ala Thr Leu Thr 
305 310 315 320 

Phe Phe Leu Ser Ala Val Val His Glu Ala Met Val lie Phe Arg Tyr 
325 330 335 

Leu Phe Phe Gin Gin Met Ala Leu Asn Thr Lys Phe Arg Arg lie Asn 
340 345 350 

Val Phe Trp Gly Cys Gly Pro Ser Val Thr Leu Tyr Leu Thr 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Asn His Leu lie Trp Leu lie Phe Phe Tyr Trp Leu Phe His Ser 
15 10 15 

Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg Glu Phe 
20 25 30 

Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp Gin Asn 
35 40 45 

Trp Lys lie Pro Val His Lys Trp Cys He Arg His Phe Tyr Lys Pro 

50 55 60 

Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Asp Arg Gly Val 
65 70 75 80 

Pro Gly Pro Ser Ala Phe Phe His Val Val Thr Trp Val Ser Val Pro 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGGGGACGA AAATTAGCCG CTATTAATTC TGGTATTGCC AC CTAGACAA GAAGTAAACA 60 
GACACAGATG CAAGAGTTCG AATCTCTTAG C 9! 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 base pairs 

(B) TYPE: nucleic acid 

( C > STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTATAAAGAT TTAATAGCTC CACAGAACAG TTGCAGGATG CCTTAGGGTC GACTACGTCG 60 
TAAGGCCGTT TCTGAC 76 
(2)* INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
C ATT GCAGTT ACACGTGAAT GC 22 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



WO 97/45439 



-75- 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TAGCTCCACA GAACAGTTGC AGG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
CTCTGACAAC AACGAAGTCA G 
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What is claimed is; 

1. An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyltransf erase II. 

5 

2. An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyltransf erase III. 

3. The isolated nucleic acid of claim 1 or 2, wherein 
10 the nucleic acid is DNA or RNA. 

4. The isolated nucleic acid of claim 3, wherein the 
nucleic acid is cDNA or genomic DNA. 

15 5. The isolated nucleic acid of claim 1 comprising a 
nucleic acid having the sequence as set forth in 
Figure 15. 

6. The isolated nucleic acid of claim 1, wherein the 
20 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyltransf erase II having 
substantially the same amino acid sequence as set 
forth in Figure 15. 

25 7. The isolated nucleic acid of claim 2, comprising a 
nucleic acid having the sequence as set forth in 
Figure 16. 

8. The isolated nucleic acid of claim 2, wherein the 
30 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyltransf erase III having 
substantially the same amino acid sequence as set 
forth in Figure 16. 



35 



9. 



The isolated nucleic acid of claim 1 comprising a 
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nucleic acid having the sequence designated Seq. 
I.D. No.: 11. 

10. The isolated nucleic acid of claim 1, wherein the 
nucleic acid encodes a mouse wildtype acylcoenzyme 
A: cholesterol acyltransf erase II having 
substantially the same amino acid sequence as the 
sequence designated Seq. I.D. No.: 12. 

11. The isolated nucleic acid of claim 1, wherein the 
nucleic acid encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II. 

12. The isolated nucleic acid of claim 2, wherein the 
nucleic acid encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase III. 

A vector comprising the isolated nucleic acid of 
claim 1 or 2 . 

The vector of claim 13 further comprising a promoter 
of RNA transcription operatively linked to the 
nucleic acid. 

15. The vector of claim 14, wherein the promoter 
comprises a bacterial, yeast, insect or mammalian 
promoter . 

16. The vector of claim 14, further comprising plasmid, 
cosmid, yeast artificial chromosome ( YAC ) , 
bacteriophage or eukaryotic viral DNA. 

17. The vector of claim 14 designated YEpAB-ACAT2. 

18. A host vector system for the production of a 



13. 



>0 



14, 
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polypeptide which comprises the vector of claim 14 
in a suitable host. 

19. The host vector system of claim 18, wherein the 
suitable host is a prokaryotic or eukaryotic cell. 

20. The host vector system of claim 19, wherein the 
prokaryotic cell is a bacterial cell. 

21. The host vector system of claim 19, wherein the 
eukaryotic cell is a yeast, insect, plant or 
mammalian cell. 

A method for producing a polypeptide which comprises 
growing the host vector system of claim 18 under 
suitable conditions permitting production of the 
polypeptide and recovering the polypeptide so 
produced. 

23. A method of obtaining a polypeptide in purified form 
which comprises: 

(a) introducing the vector of claim 14 into a 
suitable host cell; 

(b) culturing the resulting cell so as to produce 
the polypeptide; 

© recovering the polypeptide produced in step 
(b) ; and 

(d) . purifying the polypeptide so recovered. 

24. A purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II . 

25. A purified mutant acylcoenzyme A: cholesterol 
acyltransf erase II . 

35 
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A purified wilcitype acylcoenzyme A: cholesterol 
acyltransferase III . 

A purified mutant acylcoenzyme A: cholesterol 
acyltransferase III. 

An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within a nucleic 
acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase II without hybridizing 
to a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase II. 

An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransferase II without hybridizing 
to a nucleic acid which encodes a wildtype 
acylcoenzyme A: cholesterol acyltransferase II. 

An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within a nucleic 
acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase III without hybridizing 
to a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase III. 

An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransferase III without hybridizing 
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to a nucleic acid which encodes a wildtype 
acylcoenzyme A: cholesterol acyltransf erase III. 

The oligonucleotide of claim 28, 29, 30 or 31 
wherein the nucleic acid is DNA or RNA. 

A nucleic acid having a sequence complementary to 
the sequence of the isolated nucleic acid of claim 
1 or 2. 



A method for determining whether a subject known to 
have an imbalance in sterol levels has the imbalance 
due to a defect in esterif ication of sterol which 
comprises : 

(a) obtaining from the subject an appropriate 
sample containing a mixture of all of the 
subject's nucleic acids; and 

(b) determining whether any nucleic acid in the 
sample from step (a) is, or is derived from, a 
nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransf erase so 
as to thereby determine whether the subject's 
imbalance in sterol levels is due to a defect 
in esterif ication of sterol. 



The method of claim 34, wherein the determining of 
step (b) comprises: 

(I) contacting the sample of step (a) with the 

isolated nucleic acid of claim 11 or 12 or 
the oligonucleotide of claim 29 or 31 
under conditions permitting binding of any 
nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes 
a mutant acylcoenzyme A: cholesterol 
acyltransf erase to the nucleic acid or 
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oligonucleotide so as to form a complex; 

(ii) isolating the complex so formed; and 

(iii) identifying the nucleic acid in the 
isolated complex so as to thereby 

5 determine whether any nucleic acid in the 

sample contains a nucleic acid which is, 
or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or III. 

10 

36. The method of claim 35, wherein the isolated nucleic 
acid or the oligonucleotide is labeled with a 
detectable marker. 

15 37. The method of claim 36, wherein the detectable 
marker is a radioactive isotope, a fluorophore or an 
enzyme . 

38. The method of claim 35, wherein the nucleic acid 
20 sample is first bound to a solid matrix before 

performing step (I) . 

39. The method of claim 35, wherein the sample comprises 
blood or sera. 

25 

40. A method for treating a subject who has an imbalance 
in sterol levels due to a defect in esterif ication 
of sterol which comprises introducing the isolated 
nucleic acid of claim 1 or 2 into the subject under 

30 conditions such that the nucleic acid expresses a 

wildtype acylcoenzyme A: cholesterol acyltransf erase 
II or III, so as to thereby treat the subject. 



41. 

35 



A method for inhibiting wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III in a subject 
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which comprises transforming appropriate cells from 
the subject with a vector which expresses the 
nucleic acid of claim 33, and introducing the 
transformed cells into the subject so as to thereby 
5 inhibit wildtype acylcoenzyme A: cholesterol 

acyltransf erase II or III. 

42. The method of claim 41, wherein the nucleic acid of 
claim 33 is capable of specifically hybridizing to 

10 a mRNA molecule encoding acylcoenzyme A: cholesterol 

acyltransferase II or III so as to prevent 
translation of the mRNA molecule. 

43. A method for inhibiting the wildtype acylcoenzyme A: 
15 cholesterol acyltransferase II or III in a subject 

which comprises introducing the oligonucleotide of 
claim 28 or 30 into the subject so as to thereby 
inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. 

20 

44. The method of claim 43, wherein the oligonucleotide 
of clam 28 or 30 is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme 
A: cholesterol acyltransferase II or III so as to 

25 prevent translation of the mRNA molecule. 

4 5. A method for identifying a chemical compound which 
is capable of inhibiting acylcoenzyme A: cholesterol 
acyltransferase II or III in a subject which 
30 comprises: 

(a) contacting a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III with the 
chemical compound under conditions permitting 
binding between the acylcoenzyme and the 
3 5 chemical compound; 
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(b) detecting specific binding of the chemical 

compound to the acylcoenzyme; and 
<$D determining whether the chemical compound 
inhibits the activity of the coenzyme so as to 
5 identify a chemical compound which is capable 

of inhibiting acylcoenzyme A: cholesterol 
acyltransferase II ir III in a subject. 

46. A pharmaceutical composition comprising the chemical 
10 compound identified by the method of claim 45 in an 

amount effective to inhibit acylcoenzyme A: 
cholesterol acyltransferase II or III in a subject 
and a pharmaceutical^ effective carrier. 

15 47. A method of treating a subject who has 

atherosclerosis comprising administering the 

pharmaceutical composition of claim 4 6 to the 
subject. 

20 48. A method of treating a subject who has 

hyperlipidemia comprising administering the 

pharmaceutical composition of claim 4 6 to the 
subject . 



25 



49. 



A transgenic, nonhuman mammal comprising 
isolated nucleic acid of claim 1 or 2 . 
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FIG. 2B 



1.9Kb 2.9Kb 



ARE 1 1 are! ANA 
Wild-type 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 97/43439 



PCT/US97/D9460 



6/33 



FIG. 2D 



1. 



KO-5' 

70 bpi n 



LEU2 24 bp 

2i bp < n 



KO-3' 



]52 bp 



2. 



X X 



ARF2 



are2-5' 



3. 



9W bp 



L2-3' 



2187 



2206 



are2-3' 



3 are! A 



-JARE2 



PCT/US97/09460 

7/33 



FIG. 2E 




SUBSTITUTE SHEET (RULE 26) 



WO 97/45439 



PCTTUS97/09460 



8/33 





SUBSTITUTE SHEET (RULE 26) 



WO 97/45439 



PCT/US97/09460 



9/33 





WO 97/45439 



PCT/US97/09460 



10/33 



FIG. 4C Sterol Ester 




I 1 1 1 1 r 

0 1000 2000 3000 4000 5000 



H-oleate incorporation 
DPM/mg dry weight 



FIG. 4D 




0.000 0.050 0.100 0.150 

14 C-acetate incorporation 
(ratio of incorporation, sterol/fatty acid) 



WO 97/45439 



PCIYUS97/09460 



11/33 



< 

to 

a 





















o 


o 


o 


o 




_ 




o 


o 


o 


o 


o 


O 


o 


o 


o 


o 


o 


o> 






o 


o 


o 


o 


o 


o 


o 


o 


t-H 






rn 


|— : 




r*> 




to 


\0 




CO 


cj\ 




T— | 




t-H 


«-H f-H 




03 


Ol 


Ol 


(J 


l_\ 






{j 


03 


03 




(J 


f 1 






03 


03 


(J 


j_j 


03 




03 


03 






(J 




03 




r \ 


i"i 




03 






■*-] 


n\ 
\J* 


u 








Oi 




03 


\J 






fr» 

ti» 


03 


JJ 


JJ 


Jj 


1 

Jj 


J_ ' 


03 


xJ 


o> 


03 


o> 


iJ 


JJ 


Jj 


03 


U 


Jj 


u 


xJ 


03 


u 


03 


jJ 


03 


JJ 


Jj 


Ol 


Jj 


jJ 


JJ 


03 


XJ 


U 


U 


03 


o> 


03 


O 


03 


03 


03 


Jj 


LI 


03 


JJ 


03 


Ol 


01 


XJ 


JJ 


JJ 


jJ 


JJ 


Jj 


U 




o» 


Oi 


03 


jJ 


03 


Jj 


xJ 


o 


U 


03 


03 


Jj 


o> 


Jj 


jj 


03 


Jj 


jj 


o> 


01 


id 


Jj 


L) 


jJ 


jj 


Jj 


03 


03 


jj 


03 


Jj 


21 


03 


03 


Ol 


Jj 


03 


o 


U 


JJ 


o> 


Jj 


jJ 


JJ 


JJ . 


O) 


Oi 


o 


Ol 


xJ 


O) 


03 


x-> 


03 


Jj 


03 


JJ 


u 


Jj 


O) 


o» 


05 


D» 


*D 


o> 


03 


jJ 


jJ 


o 


o 


jj 


Ol 


Jj 


JJ 


Ol 


Ol 


xJ 


05 


JJ 


jJ 


03 


jJ 


u 


03 


Oi 




03 


Oi 


U 


03 


U 


0> 


03 


03 


03 


JJ 


Jj 


03 


Jj 


Ol 


JJ 


03 


xJ 


U 


Oi 


jJ 


V 


03 


03 


JJ 


03 


JJ 


u 


(J 


O 


u 


jJ 


JJ 


jj 


oi 


O 


03 


Ol 


03 


JJ 


jJ 


05 


03 


05 


u 


03 


L> 


o» 


m 


JJ 


03 


Oi 


JJ 


jJ 


03 


jJ 


U 


jJ 


JJ 


03 


U 


03 


L> 


03 


03 


Ol 


Jj 


u 


03 


jJ 


Ol 


JJ 


JJ 


o» 


o> 


03 


A3 


03 


03 


jJ 


JJ 


u 


jJ 


03 


03 


Ol 


u 


O 


Oi 


03 


JJ 


03 


jJ 


O 


jJ 


o 


Ol 


jj 


jJ 


05 


u 


O 


Oi 


JJ 


JJ 


u 


01 


jj 


JJ 


u 


jJ 


jj 


jJ 


U 


Ol 


U 


u 


O 


JJ 


03 


03 


Oi 


U 


A3 


03 


01 


JJ 


Oi 


03 


O 


Oi 


u 


JJ 


03 


03 


03 


03 


o 


JJ 


JJ 


Ol 


Ol 


U 


05 


u 


jj 


oi 


U 


03 


03 


o» 


Jj 


03 


Jj 


03 


03 


03 


O 


Oi 


u 


Oi 


o> 


Ol 


03 


jj 




jJ 


Jj 


Ol 


O 


Oi 


Oi 


O) 


u 


JJ 


03 


jJ 


U 


03 


o> 


JJ 


L> 


03 


jj 


03 


Ol 


o» 


Oi 


ns 


JJ 


jJ 


03 


03 


03 


JJ 


O 


03 


V 


jj 


Ol 


u 


oi 


u 


JJ 


03 


03 


U 


jJ 


O 


Ol 


JJ 


u 


Oi 


m 


L> 


u 


Ci 


Ol 


U 


03 


U 


jj 


Oi 


03 


03 


Oi 


JJ 


o 


Oi 


XJ 


Oi 


03 


jJ 


03 


03 


jj 


03 


JJ 


U 


Jj 


jJ 


u 


jJ 


U 


Oi 


U 


0J 


JJ 


JJ 


JJ 


03 


L> 


jJ 


XJ 


03 


xJ 


u 


u 


D> 


U 


Ol 


03 


JJ 


03 


01 


jJ 


U 


03 


03 


03 


Oi 


03 


JJ 


jJ 


JJ 


JJ 


U 


o> 


o> 




03 


O 


U 


XJ 


u 


o 


o» 


03 


03 


U 


JJ 


03 


03 


CJ 


Ol 


05 


Ol 


05 


u 


u 


o> 


S! 


03 


U 




Ol 


U 


O 


jJ 


Ol 


03 


XJ 


o 


u 


OS 


Ol 


Ol 


03 


ID 


jJ 


o> 


JJ 


JJ 


JJ 


O 


03 


Oi 


os 




o> 


JJ 


Ol 


o 


O 


JJ 


u 


Oi 


03 


03 


XJ 


JJ 


o 


o> 


Ol 


03 


JJ 


Oi 


JJ 


o 


u 


Oi 


Ol 


o» 


03 


Oi 


u 




JJ 


03 


jJ 


Jj 


jJ 


u 


u 


03 


03 


03 


XJ 


05 


JJ 


f0 


03 


U 


03 


03 


jj 


Jj 


03 


03 


Oi 


Ol 


jj 


03 


OS 


o> 


03 


U 


Oi 


OS 


03 


Oi 


JJ 


03 


XJ 


Oi 


jJ 


Oi 


C7> 


o> 


o» 


jj 


03 


03 


o> 


Ol 


Oi 


03 


Ol 


jJ 


03 


u 


jj 


JJ 


Ol 


03 


U 


03 


JJ 


Jj 


JJ 


U 


03 


jJ 


jJ 


u 


o> 


u 


03 


JJ 


jJ 




Ol 


03 


o> 


Ol 


u 


Oi 


03 


Ol 


o 


o> 


U 


V 


O 


U 


Oi 


JJ 


JJ 


Ol 


u 


03 


xJ 


Ol 


XJ 


03 


Ol 


01 


JJ 


Ol 


JJ 


o 


03 


jJ 


03 


03 


U 


XJ 


u 


03 


03 


u 


u 


03 


O' 


01 


03 


03 


U 


03 


Ol 


jJ 


u 


Oi 


O 


01 


Ol 


03 


03 


Oi 


Ol 


JJ 


U 


JJ 


05 


o 


03 


Ol 


Ol 


03 


o> 


jJ 


jJ 


03 


jJ 


U 


Ol 


JJ 


jJ 


XJ 


Ol 


03 


03 


jJ 


Jj 


JJ 


JJ 


03 


Oi 


05 


03 


JJ 


03 


U 


u 


jJ 


U 


jJ 


Jj 


jJ 


jJ 


o> 


OS 


JJ 


05 


JJ 


03 


O 


o 


o 


03 


jJ 


Jj 


Oi 


U 


03 


Oi 


O 


o> 


U 


05 


U 


u 


o 


Oi 


03 


03 


Jj 


jJ 


03 


03 




Ol 


u 


JJ 


Ol 






m 
\ji 


kj> 


^ 




»i 


O 




Ol 


^ 


Jj 


JJ 




VJ 


03 


jJ 


jJ 


03 


03 


ui 


jJ 


jJ 


Jj 


05 


o> 


Oi 


u 




m 


O 


1 1 
\J 


^ 


03 


03 


jJ 


jJ 


Jj 


U 


JJ 


03 


Jj 


03 


03 


jJ 


jJ 


JJ 


o> 


03 


o> 


03 


03 


xi 


X-> 


JJ 


o 


Ol 


O 


V 


Ol 


JJ 


03 


03 


03 


U 


u 


x> 


jJ 


Oi 


Oi 


Ol 


JJ 


pj 


JJ 


Oi 


Ol 


03 


03 


U 


u 


05 


JJ 


xJ 


Ol 


xj 


o> 


Ol 


03 


jJ 


Ol 


Ol 


JJ 


JJ 


u 


03 


JJ 


03 


o 


O 


05 


JJ 


03 


03 


JJ 


Oi 


U 


jJ 


Jj 


U 


XJ 


xJ 


o 


jJ 


05 


CJ1 


U 


U 


jJ 


jJ 


JJ 


03 


03 


03 


X-i 


05 


u 


0> 


03 


Oi 


u 


03 


o> 


03 


o 


o> 


JJ 


* U 


O 


0> 


Jj 


o> 


03 




Oi 




03 


O 


o 


05 


o> 


jj 


03 


Oi 


xJ 




U 








03 


03 


u 


U 




jJ 


03 


Ol 


CJ 


A3 














o 


05 


03 


o 


03 




Ol 


Oi 




Oi 


Ol 




03 


^•j 












*J 




O) 




(j 


o* 


03 


03 


Ol 


03 


05 


03 




jJ 


rr\ 

L>» 




(Q 




Oi 


03 


jj 


O) 


0> 




Ol 


03 


03 


03 


03 


03 


(J 


{j 


03 


Ol 


03 


03 




i } 




05 












{j 


03 


Oi 




i i 


1 1 




jj 




Oi 




^ 




o> 


{j 


o> 


Oi 


03 


Ol 


x > 


(j 


I } 


I > 


(j 


03 


03 


o> 




ij 


Oi 


jj 


o 


Ol 


XJ 


(j 


u 




Oi 


03 






iJ 


03 


Oi 


jj 


Ol 


i_j 


xj 


xJ 




Ol 


Ol 


03 




u 


JJ 


O 


o> 


jj 


03 


03 


Oi 


03 


JJ 


JJ 


jj 


Ol 


O 


jj 


oi 


i_> 






jj 


Oi 


Oi 


4J 


03 


03 




Oi 


Oi 


(j 


JJ 


JJ 


JJ 


JJ 


JJ 


o» 


Jj 


JJ 


O 


03 


05 


u 


01 


Ol 


Ol 


03 




03 


03 




Oi 


XJ 




I i 


r t 
W 








U 


JJ 


jJ 


o 


U 


u 


Oi 


JJ 


u 


jJ 


JJ 


JJ 


Oi 


u 


(j 


{j 






o» 




03 




'03 


i » 


03 


J ) 


03 




(0 


O 




Jj 


03 


Oi 


L> 


u 


o» 


03 


jJ 


Oi 


Ol 


§ 


u 


o 


o> 


u 


03 


Ol 


03 


Oi 


05 


a 


JJ 


o> 


o» 


xJ 


JJ 


oi 


03 


Jj 


JJ 


Jj 


JJ 


03 


jJ 


Oi 


U 


Ol 


Jj 


a 


JJ 


03 


U 


Jj 


S! 


u 


03 


JJ 


Oi 


JJ 


jJ 


03 


o» 


Oi 


JJ 


O 


jJ 


u 


o> 


Ol 


JJ 


Oi 


03 


Ol 


o 


u 


JJ 


jj 


o> 


O 


03 


o 


u 


u 


Oi 


jj 


U 


JJ 


o 


XJ 


o 


u 


D> 


o 


m 


u 


03 


03 


Oi 


03 


Ol 


03 


Ol 


u 


u 


o 


D» 


o> 


03 


o 


03 


jJ 


jj 


jJ 


Ol 


jj 


03 


Oi 


03 


xJ 


Oi 


o 


jj 


03 


03 


03 


jj 


03 


03 


Oi 


U 


03 


jj 


XJ 


u 


o> 


03 




JJ 


Ol 


jj 


03 


jJ 


jj 


O 


Ol 


xJ 


U 


m 


o 


o> 


U 


03 


jj 


jj 


U 


03 


Oi 


u 


Oi 


O 


U 


o> 


u 


u 


Ol 


JJ 


03 


JJ 


U 


jj 


05 


u 


03 


u 


03 


03 


03 


Oi 


03 


03 


m 


03 


U 


jJ 


jJ 


o 


Ol 


XJ 


Oi 


o> 


U 


Ol 


03 


03 


03 


Oi 


jJ 


JJ 


JJ 


Jj 


u 


O 


U 




U 


Ol 


03 


03 


03 


Ol 


JJ 


O 


Ol 


XJ 


Jj 


XJ 


xJ 


jj 


Ol 


(0 


Ot 


0J 


0J 


03 


Oi 


JJ 


JJ 


JJ 


u 


o> 


Oi 


o> 


03 


u 


JJ 


o> 


03 


u 


Jj 


03 


Oi 


jj 


03 


^j 


03 


Oi 


Ol 


JJ 


U 


03 


Ol 


JJ 


u 


jj 


Jj 


Ol 


03 


Oi 


Ol 


Oi 


jj 


u 


Oi 


U 


JJ 


Jj 


03 


jj 


03 


03 


XJ 


XJ 


o» 




o 


o 


o 


o 


o 


o 


O 


o 


o 


O 


O 


c 


o 






fM 






in 




r- 


00 


o\ 


O 




fM 






8 






< o: 






u 

^ w 


< 

8^ 


s, 


o 






< 








e 

< W 


3 

cj a. 


e. 


o 






< 








U3 




g. 






a 


< 




^ CO 


8< 










3 to 


3 Cxi 














O 

S txl 


< 


< 




WO 97/45439 



PCT/US97/09460 



12/33 



0> i— t 
O 



r- vo 



od m 



CM <0 



<Tv O 



v£> 
o rg 
(M cnj 



fNJ CM 



(M V£> 

rg r- 
rs) cm 



0\ »— ♦ 

<N O 

rg m 



rg m 



rsi 



(M 

in r> 
rg m 



2 

CJ W 



CM 

t 

< 

vn 
0 



o 



in 
yJD 



U 

8 Cd 

8 

< 

8< 



< n 
< 

B o 

So 



< 

(J CU 





E- 


8 o 






< 

cj 

o < 






6 Q 








< 


< 

O Q 






§< 








H CO 


^ < 


I, 




5„ 


< oc 


o cu 

f: 


< 

O X 

o 







u 

Sz 



o o 

o rg 

O o 

r- «-h 



I- 



< £ 

I, 

CJ X 

o 
< 



o > 

O J 

cf cu 

E- 

< 

^ to 

in r- 
r* eg 



b+ 

co 



< CO 



3 cu 

CJ J 



O > 

8 

8o 

o cu 
$ o 

I: 

< w 

o 
< 

b* > 
6 Q 



8 

< CO 
CJ J 



O cs> 
in m 



^ Cu 

8 



Kco 



< 

< t- 1 

^ CO 



H 3: 

U 

< H 

(J 
U 



' o 



in r* 



8. 



^ to 
8a: 



o a 

8. 



S3 

< CO 

< CO 



0 O 

5. 



O DC 

o rg 
O O 
O 
rg 



8< 

< 

U CU 



< 

8« 



g 

U cu 



> Lb 



CJ Or 
^ Cu 
< t-t 

I, 

in r- 
r- rg 
O rg 
rg 



O 

O CL 

g 

a > 

< DC 

o > 



< 

^ CO 

< 

u x 
o 

S- 

o > 



o a: 

u a 
6 cu 



E- 

8 

^ CO 

O rg 
in in 
«-h rg 
rg 



< 



CJ o 

CJ 

2 2 



< 

a 

< E- 

u 

a cu 
< 

CJ 

CJ CU 

u > 

< CO 
H CO 



i 

8< 

< 

^ CO 

8 

cja: 
m 

rg r- 
rg rg 
rg 



i> 



< 

o 

u 
(j 

CJ CU 

s. 

CJ 
CJ 

CJ cu 

& 

8 

< CO 
8 DC 

CJ 



O rg 
O O 

rg 



8 

CJ CC 



E- 

So 



8 

< DC 



CJ 

< 

O > 

E- 
< 



8 O 
^ ti- 



8 

cj a 
< 

8< 



rg 



< M 



c3 > 
o 

CJ cu 



< M 

u 

^ CO 

u 

5 2 



CJ -J 

o > 

O > 



a: 
< 

8 

< CO 



u 
(J 

CJ CU 
3 CO 

cj a 



O rg 
in in 



WO 97/45439 



PCT/US97/09460 



13/33 



in o 



< QC 



04 fNJ ^ 



So 8 < 







o> 






CM 




0> <— « 






OO 




CO O 




o m 


rs» 




cm m 


eg m 


in 








8 


S 






i> 








u. 




u a: 


















5z 


8 a: 






in 

in m 



o oa m r-* 
o o cm 







CTi 












Tj» 








l— < 


(M 


r-"i 




m 


KO 




m 


m 


m 


rn 




0> 


TJ 




(j 


01 




xj 




TJ 


TJ 


TJ 






tj 


*^ 




*^ 








~L 


TJ 


w 




TJ 


lh 


nj 


TJ 


xJ 


TJ 




T3 


A3 


TJ 


i-J 


xJ 


xJ 




►i 


i-J 


XJ 


i_> 




TJ 


o» 


xJ 


TJ 


xJ 




U 




U 


XJ 


i-J 




(J 1 




TJ 


U 


xJ 




TJ 


xJ 


L) 


o> 


o> 








xJ 


XJ 


TJ 








T3 


^ 


xJ 


TJ 




U 


T3 




XJ 








TJ 


XJ 


O 








o> 


XJ 


U 


Ol 


^j 




TJ 


TJ 


XJ 




TJ 




TJ 


Ol 




•0 


«L 


xJ 


o> 


xJ 


u 


Ol 


Ol 


U 


Oi 


TJ 


XJ 


^ 


XJ 


o> 


TJ 


TJ 


o> 




XJ 


03 


XJ 


o> 


TJ 


TJ 


CJ 


O 


XJ 


TJ 


O 


xJ 


xj 


i-J 


U 




XJ 




O* 


n3 


TJ 


o> 


xJ 




03 


i_> 


O 


xJ 


o> 




o* 


TJ 


TJ 


TJ 


XJ 




O* . 


O 


TJ 


TJ 


TJ 








TJ 


TJ 


TJ 


TJ 








TJ 


O 


(j 










*^ 


(j 








TJ 




jj 








m 

VJ 1 




ij 


xj 




TJ 


TJ 


TJ 


i t 


01 


0> 




TJ 




o> 


TJ 


XJ 


XJ 


Ol 


TJ 


XJ 




XJ 


T3 


Ol 


TJ 


u 


■ o> 


O) 


TJ 


Ol 


TJ 


oi 


OJ 


XJ 






/rt 
•O 


TJ 










O) 




£ 1 


f0 


1 1 

\j 






Cj 




rn 

\j* 




TJ 


xJ 


o> 




TJ 


(J 


TJ 




TJ 


y 






r \ 




O* 




TJ 


o> 


t~n 
U' 




O* 


13 


o> 


c 






xJ 


A3 


4 l 




T3 




XJ 


(j 


(0 


XJ 




TJ 


XJ 


o 


03 


tJ 


TJ 


TJ 


TJ 


Ol 


xJ 


XJ 


TJ 


xJ 


o> 


(IJ 


D» 


XJ 


o» 


Ol 


j_> 


fd 


fd 


xJ 


XJ 


XJ 


o 


0» 


03 


Ol 


xJ 


ij 


o 


XJ 


Oi 


as 


TJ 


XJ 


o 


XJ 


XJ 


XJ 


T3 


xJ 


iJ 


u 


fj 


TJ 


XJ 


TJ 


u 


03 


o> 


(J 


Cj 


xJ 


0) 


o 


A3 


XJ 


Ol 


Cj 


TJ 


xj 




TJ 


XJ 


(j 


JJ 


0> 


as 




TJ 




o> 


o» 


T3 


TJ 


XJ 


A3 


cn 


as 


XJ 


Tt 


Ol 


(j 


o> 


XJ 


u 


TJ 


XJ 


TJ 


j_> 


o 


to 


XJ 


TJ 


■ xJ 


xj 


u 


D> 


TJ 


TJ 


TJ 


TJ 


XJ 


t) 


TJ 


Ol 


XJ 


Oi 


XJ 


O 


o> 


Oi 


xJ 


nj 


TJ 


m 


TJ 


XJ 


Ol 


m 


u 


cn 


o» 


XJ 


xJ 


o» 


o 


TJ 


TJ 


Ol 


TJ 


xj 


XJ 


TJ 


U 


XJ 


TJ 


o 


o 


o» 


XJ 


O 


XJ 


(0 


TJ 


m 


aj 


TJ 


TJ 


u 


u 


o» 


to 


u 


Ol 






VJ' 


(j 


u 


TJ 


o> 


u 


o» 


u 


u 


TJ 


XJ 


m 


O) 


XJ 


Ol 


TJ 


xj 


O) 


Oi 


o> 


XJ 


u 


u 


XJ 


Ol 


XJ 


TJ 


u 


u 


(0 


XJ 


u 


U 


XJ 


iJ 


o» 


S! 


XJ 


XJ 


XJ 


O 






U 


XJ 


XJ 


(J 


o 


o» 


g 


XJ 


XJ 


XJ 




XJ 




XJ 


Ol 


XJ 


XJ 


O 


XJ 


XJ 


XJ 


XJ 


XJ 


CD 


XJ 


XJ 


U 


tn 


XJ 


cn 


U 


to 


XJ 


XJ 


o 


o> 


XJ 


to 


Ol 


XJ 


0) 


o> 


XJ 


to 


XJ 


XJ 


Ot 


XJ 


XJ 


Ol 


nj 


O 


(0 


01 


o> 


TJ 


TJ 


tj 


0 


XJ 


XJ 


XJ 


TJ 


o> 


u 


XJ 


XJ 


XJ 


TJ 


o> 


XJ 


xJ 


XJ 


U 


o> 


XJ 


o 


XJ 


XJ 


Ol 


aj 


XJ 


(0 


XJ 


Ol 


Ol 


TJ 


U 


u 


XJ 


XJ 


XJ 


TJ 


0» 


u 


xJ 


U 


U 


Ol 


(0 


AJ 


o 


XJ 


XJ 


TJ 


(0 


0) 




TJ 


O 


XJ 


o 


o 


o 


O 


o 


o 


in 


in 


in 


in 


m 


in 


o 


rH 


CM 


m 




in 


m 


m 




m 




m 



WO 97/45439 



PCT/US97/09460 



14/33 



CD 

in 

O 



o 


o 


o 


O 


O 


o 


o\ 


o 


o 


o 


O 


O 


o 


CO 


r- * 




ro 




iTt 






u 


u 


XJ 


Di 


XJ 


u 




<0 




XJ 


xJ 


o» 


a* 




0* 


(J 


xJ 


a* 


rj 


a* 




n) 


<TJ 


rO 


XJ 


(j 


xJ 


2 


{j 


rj 


j i 


*0 


a> 


1 t 


U O 




Ot 


D 


xj 


a> 


(0 






1 1 


1 1 


a* 






ri 




u 


ft-) 


XJ 




(J 






0* 


Oi 


rO 




a* 










i ,» 


<0 












(j 


xj 


0| 








xj 


(0 


r0 


01 




<d 


fl3 


U 




r0 


01 




*0 


Oi 




ro 


rO 


at 




<0 


0| 


jj 




L> 


r0 




(J 






XJ 


XJ 


rQ 




I 1 




XJ 


(0 


(0 


rO 




Ot 




xJ 


xj 




0> 




{J 


m 

U' 


J 1 


xj 




01 


f 1 


rtt 


r > 
LJ 


ro 


1 1 


0) 


ro 






JJ 


*■* 


r i 
w 


rr> 






u 


XJ 


XJ 






rn> 

LP 




u 


U 


rO 






XJ 




Jj 


XJ 


o 




IJ . 






u 


(0 


At 




vJ' 


u» 


u 


u 


O) 


XJ 


o> 


JJ 


o 


xJ 








<j» 


r i 




u* 


rn 






VJ 


(J 


(J 






xj 


0) 




1 1 


o* 


to 


XJ 


r0 


<0 




m 


<0 


rn 




r0 




to 


gj 


(j 




a> 


*0 


o 


O 






<0 


XJ 


(0 


<0 






(j 


rj 


<0 


0) 








XJ 


XJ 


(0 


a> 


CI 


u 


u 


(J 


XJ 


<0 


(0 






Ol 


(J 


{J 


r0 


ro 


S s: 


f0 




1 1 


xj 


(TJ 


Ot 


f 1 


nt 
•U 


r i 
\J 




r0 


ro 


tn 
*u 




(0 


iu 


vJ 




nj 


»n 


171 


vj 


to 


a> 


to 




»u 


(j 




£J 




(0 


rn 


Al 


to 




CJ 


w 








o 


U 


f-T» 




(0 


nt 


Ol 








^ j 


a* 


a> 


1 1 


*0 


03 




{j 


i » 


J 1 


J 1 


*0 


(13 


vji 






f0 


rO 


nj 


T3 




xj 


<0 


tn* 


Ot 




r0 


Ol 


U 


rn 


a! 


r0 


{J 


(j 


0| 


XJ 


m 


r> 




XJ 


^ i 




XJ 


xj 


0» 


rQ 


rO 


xj 




U ' 


a» 


XJ 


r0 


XJ 


ra 




xJ 


xJ 


(0 


a) 


(0 


ot 


(J 


(j 


Ot 


L) 


(0 


XJ 


tS 


xJ 


O) 


a! 


fO 


u 


ro 


(J 


(0 


XJ 


XJ 


(0 




Ot 


Ot 


(0 


ro 


ro 


{J 


Di 


(j 


jj 


(0 


XJ 


o> 


a» 


U 


o» 


U ' 


D 




no 


a! 


at 




Oi 


D» 


(0 


a> 


XJ 


(0 


o> 


0> 


(0 


XJ 


xJ 




Di 


XJ 




XJ 


xJ 


o> 


o 


(0 


o 


a) 


ro 


XJ 


XJ 


o 


ro 


o» 




XJ 


JJ 


ro 


(0 


o 


Xj 




xj 


JJ 


ra 


a> 


0> 


ro 


XJ 


ro 


JJ 


ro 


CO 


<0 


u 


o 


U 


U 


ro 


u 


U 


o* 




xJ 


ro 


ro 


at 


at 




(0 


(0 


xJ 


Oi 


ra 


JJ 


m 


m 


XJ 


U 


XJ 


u 


XJ 


u 


o> 


U 


m 


jj 


u 


u 


Oi 


jj 


o» 


u 


ro 


o 


Ot 


U 


ro 


u 


a» 


m 


ro 


o» 


XJ 


«3 


xJ 


ro 


(0 


U 


o 


cn 


a> 


xJ 


XJ 


ro 


at 


(0 


o> 




U 


jj 


jj 


ro 


u 


u 


u 


CP 


u 


ro 


xJ 


o> 


o 


u 


XJ 


XJ 


Ot 


ro 


ro 


03 


jj 


. D> 


XJ 


U 


rO 


V 


Oi 


(0 


XJ 


u 


XJ 


U 


XJ 


<0 


j_> 


cn 


O 


ro 


m 


JJ 


*0 


to 


jj 


<0 


XJ 


ro 


Oi 


xJ 


a> 


jj 


u 


JJ 


Oi 


XJ 


u 


o 


u 


a» 


JJ 


u 


JJ 


*0 


u 


<o 


u 


jj 


a» 


o> 


XJ 


o 


xJ 


u 


xJ 


ro 


at 


u 


m 


cn 


m 


U 


jj 


JJ 


i_) 


u 


jj 


XJ 


XJ 


jj 


Ol 


u 


o» 


D» 


a» 


JJ 


XJ 


o> 


XJ 


o 


u 


o 


U 


a* 


05 


o 


u 


u 


u 


xJ 


u 


o 


o> 


o 


u 


xJ 


U 


XJ 


Oi 


ro 


03 


ro 


xJ 


JJ 


O 


u 


o 




(0 


xJ 


*0 


(0 


ro 


u 


o» 


o» 


Ol 


U 


o> 


o 


XJ 


U 


u 


jj 


jj 


a> 


XJ 


o 


U 


jj 


o» 


u 


xJ 


O 


u 


U 


0> 


xJ 


o> 


XJ 


ro 


u 


o> 


a* 


U 


JJ 


Oi 


O 


o> 


<0 


XJ 


o 


u 


o> 


<o 




o 


o 


o 


o 


o 


o 




f-l 


CM 


m 




m 


vx> 



rn a> 
oo in 


T3» 

a> go 


CTk CTt 
CO O 


O 






y CO 


3 Q 




S 


1? > 


< CO 



V£> TT 

o m 



c3 u 




c3 u: 










[5 > 




o o 


m in 




r- » 


vd m 








00 



1 PC 



u 
u 

in in 

r-» CD 



00 

(N o 



i o 



WO 97/45439 



PCT/US97/09460 



15/33 



rr in 



Co a> 
m o 



in 



00 CD 



CO CTi 
00 O 



a* 

o tn 



r-« CD 

cm rr 





cj 


P 








5> 










g 

< CO 


< M 


B 

< w 


< 1-4 




o 


AAG COG 
K R 


U 

u 

cj cu 






^ (JU 





CM 

m 

in g 



3 Q 
o 

< 



So 

c3 > 

B 
2 

o to 

B 

CJ J 



6 

< co 



o 

o o 

fN CM 



B 
B 

a j 



8: 



Cu 
t§ > 
O > 



c5 < ^ fcu ^ Cu 



< as 



» < 



c5 Q 



in in 
m cn 



8 

< H 



O 
U 

CJ 



8< 



C§ CO- 



8 

< Hi 

< CO 



! > 



o o 



B 

a > 

B 

O > 

u 

& CO 

8. 



CJ »-3 



B 

o 

Be. 



> 

Du 
3 Q 

in in 

rH CO 

in rvj 



s. 



CJ O 



y CO 
^ CO 

o 

5 CO 



< M 

O Q 



» CO 



u 
o 

U CU 



8. 



i Du 



H CO 

o o 

ON r-t 

in ro 



e CJ 
^ Cu 

o 

6 a 
a 

Bu 

a 

< CO 

ii 

u o 

3 CO 

g. 

CJ 

3 CO 

3 

cj a* 

^ CO 

e. 

cf CO 

g> 

in in 

ID CO 



e cj 

^ Cu 

B 



< CO 

CJ 

Bo 

< CO 
Sm 

U 
CJ 

cj cu 



S CO 



< OS 

o a: 

^ CO 

8 

< 6- 





a a. 


5 a 


a* 








a. 


















8^ 





CJ 

o o 
po 



in m 

fH CO 
CO ro 



F* cj 

8 

o CC 

B 

< z 

CJ 

B 

o > 
o 

CJ cu 

s. 



CJ o 



B 

< as 

B 

< s 



o o 

CO 



8 

O CU 

E- 

c3 Q 



CJ 

< 

CJ X 
^ 0< 



o 

8 

< e« 

CJ 

8< 

o 
o 

CJ cu 

§o 
§o 

CJ 

3*- 



St. 

u 

s* 

in in 



cS CO 



B 

o > 



5 e 



c5 Q 



8- 

< X 



c5 > 

^ Cju 

§ O 
o o 

O 



WO 97/45439 



PCT/US97/09460 



16/33 



O 
u. 



cd cn 



3. 



< CO 



6 cu 



^ a- 

y CO 



«? go 



6 Q 



•H 00 
CM 



CM LO 



So 

< CO 

< CO 



8< 

o 

8 O 

< as 

u 

& CO 



<3 CO 

§» 

3. 



8 

U flu 



o o 

ON »-H 
«-H LO 













CO Ox 




■^p ao 


o 


CN) LO 


(n m 


CN vD 




s. 


o 






Sii. 








c5 > 


§« 




s 






< a: 






cj a: 





3 



8 

< X 

a 

< as 



8. 



^ a, 

^ [14 



in in 
(N in 

CM 



u 
a 

< CO 



8 

o > 



i CO 



So § 



o o 
m in 



8 

< CO 

o 
u 



< 

CO 



o > 



8 

O QU 

in in 

.-I oo 
in 

n* 



00 »~H 

in 

rsi vo 

03 
U 
0> 

o> 
o> 

Ol 

o 
o> 

A3 

o 

u 

a> 

jj 
o> 
a 
o 
m 
o 
u 

XJ 

o> 
u 
u 

Ol 

u 
u 

fC 
fT5 
D» 
D> 
D) 
nj 
i_» 
fO 
TJ 
U 



*0 

m 



u 
u 
o 
en 
n» 
ai 
o 
Oi 
U 
i-> 
u 
u 
o 
JJ 

D» 

AJ 

o> 

XJ 

D» 
O 
<0 
U 
O 
(0 

u 
ro 
O 
O 
(0 

o 
o 

Ok 
Jj 
o 

(0 



6, 

o o 

0\ t-l 
CM 



as 

u 
u 

AJ 

o 

0} 

o> 

fO 

00 
00 

in 

04 



WO 97/45439 



FCT/US97/09460 



17/33 



o 

u> 

O 
u. 



o 


O 


o 


o 


o 


o 


t— t 


P4 




XJ 


Oi 


o> 


rO 


(0 


rO 


U 


u 


m 


xj 


rO 


o> 


iJ 


ro 


rO 


ITJ 


U 


rO 


iJ 


CJ 


(J 


U 


xJ 


rO 


10 


XJ 


o> 


fO 


<0 


rO 


co 


(0 


XJ 


10 


(0 


U 


XJ 


m 


u 


u 


ro 


(0 


(0 


Ol 


u 


rO 


D> 


u 


u 


U 


O* 


(0 


u 


XJ 


(0 


rO 


xJ 


co 


rO 


ro 


CO 


xJ 


XJ 


o> 


rO 


o> 


XJ 


ro 


o> 


XJ 


U 


XJ 


xJ 


o» 


U 


j-» 


O 


xJ 


xJ 


D> 


XJ 


iJ 


U 


ro 


(0 


XJ 


ro 


CO 


U 


XJ 


03 


XJ 


XJ 


o> 


XJ 


fO 


xJ 


U 


XJ 


O 


D» 


U 


(0 


OJ 


cn 


xJ 


a 


u 


rO 


co 


u 


O 


rO 


o» 


co 


xJ 


c0 


ro 


XJ 


XJ 


CO 


XJ 


xJ 


ro 


ra 


fO 


(0 


U 


<0 


U 


rO 


CO 


CO 


(0 


CO 


XJ 


XJ 


o> 


10 


XJ 


U 


rO 


D) 


fO 


U 


<0 


D> 


u 


a> 


O* 


m 


Dl 


o> 


CO 


U 




10 


O* 


SL 


XJ 


U 




CO 


U 


<0 


C7> 


U 


rO 


(0 


xJ 


XJ 


xJ 


D> 


rO 


co 


o> 


XJ 


xJ 


U 


U 


(0 


XJ 


XJ 


XJ 


D> 


rO 


cO 


xJ 


fO 


(0 


xJ 


o» 


D» 


U 


O) 




xJ 


u 


XJ 


O 


u 


XJ 


o> 


u 


CO 


U 


xJ 


XJ 


xJ 


xJ 


CO 


*0 


XJ 


XJ 


XJ 


xJ 


<0 


CO 


xJ 


iJ 


XJ 


XJ 


U 


xJ 


O 


xJ 


XJ 


XJ 


(0 


U 


XJ 


u 


ro 


O 


fO 


cO 






m 

<U 




JO 


XJ 


<0 


CO 


u 


CO 


U 


0> 


XJ 


XJ 


xJ 


XJ 


CO 


ro 


ro 


O 


rO 


xJ 


XJ 


OV 


XJ 


XJ 


xJ 


CO 


xJ 


OJ 


xJ 


U 


U 


CO 


U 


CO 


c0 


XJ 


O 


xJ 


XJ 


CO 


o> 


(0 


xJ 


XJ 


(0 


xJ 


xJ 


to 


o> 


xJ 


(0 


fO 


U 


xJ 


U 


XJ 


*0 


o> 


xJ 


XJ 


xJ 


XJ 






1— 1 




o 


o 









00 rn 

ro *r ro iA\D 



(3 U 

5 to 



8* 

i* 

?" 

B 

< s: 

L> 
U 
ro 
fO 
o 

CO 
O 
*0 
CO 
CO 
U 
<0 
CO 

XJ 
CO 
CO 
U 
«0 
<0 
U 

o> 

CO 
rO 
CO 
CO 
U 

cn 

CO 
xJ 

XJ 

Ot 
U 
CO 

XJ 
XJ 

CO 
O 
<0 
O 
CO 
Ot 
CO 
U 
ro 
CO 
CO 
xJ 



8 

CO < CO 



i a 



ei to l5 z 



3 Q 



i co 



< a: 



o 

8< 

c5 < 

8. 



Sd 

< 

(J . 



CJ 
CJ 



8, 



I 5 



8. 



> CO 



to 



y CO 

u 



«~t 00 
VO CO 



3 to 

if Id 

I. 

I" 

3 co 
§z 

^ O 

cl Id 
CJ 

*h 



1, 



3 2 

uo 3d 

r- rr cn a> 

cjo f-4 vo ro 
ro 



VO ro 
CO ^ 



S. 



3 w 



o 

^ CO 

a 
u 

< CO 







S 












CJ 3C 






CJ 




3 


CJ 




CJ fx, 


< £- 


< 






CJ 






cj eu 







ci Q 



C5 O 



H CO 



r» -a- 
ro vo 



(5 Q 

a w 

6- 



cm a> 

^-l oo 



f-l CO 
vo ro 



H CO 

§ O 

B 

< CO 

CJ 
CJ 

CJ CU 

8 

< £* 

CJ 

Sx 
c§ CO 

^ to 

< s 



H CO 



F» to 
B CO 



CO f-t 



<D ro 
ro vo 
CO f-i 



3 to 



3 to 

8 

< a: 

s 

< a: 

s 

< oc 

CJ 

< ^ 

3 u 
iu 



O W 



o 

CJ 

U a 
3 to 
^z 



vD ro 



H CO 
*-l CO 
o% »— » 



y C!) 



u 

c5 a 



o > 

B < 

< 

U CU 
^ CO 



vD VO CO 

OOih O ro 
CT» f— < CN 



CJ 

g 

ro vO 
CO «— • 



B CO 

< 

^ CO 
CJ 

a 
a 



: CO 



CJ 



8 

O CO 





B CO 


^ CO 




s- 


a 


O > 


8 

CJ cc 


CJ 

u 

a < 




CJ 


CJ 


^ [X, 


Sx 






CJ X 




o 




< 

u 

CJ 0i 






> 




2 Z 





< 

CJ 

CJ Cu 

CN CT> 
00 



CJ 



< 

Em 

CJ < 



y < 

IS > 
a 



O O 

CJ 

^ CO 



lz 

CO »-t 

cn <n 



WO 97/45439 



PCT/US97/09460 



18/33 



»-H 00 

rvi cd 



vo 

oo m 



v© cd m 



vo — t 
m hoo 

rr vO in 00 



vo 

CO m 



VO CO 
VO 



vO 



— < CO 
CD CO 



VO 
CD 

00 «-h 



vO 00 

in 



CM 
i 

o 

in 

d 



u 

^ (J) 

1> 

!^ 



O 

< e- 

8 

< x 



< M 

8 

H co 



8 

< CO 



oo 

s. 

go 

M> 

VO ro 
O (N 



o > 



> o 

1 s 



3, 



< 

I: j 

CJ 

< 

8 o 



o > 



E- 

< CO 



E. 



O Q 

i, 



r*> vo 

r-4 f>J 



O CU 

e 

< x 

u 



Sm 



o > 
^ Cu 

O CU 

a* 



o 

St. 



§o 

a* 

§o 



fH CO 



8. 



< x 



g. 



g, 

^ CO 

6 x 



y co 



o 

8< 



' CU 



3 x 

g. 



g- 

CO f-* 

fN ro 



i *1 



8 

< CO 



u 

^ CO 



So 



o w 
^ tu 

§ o 

g, 

1^ o 



< CO 



a 

> Cu 



e* co 
<n en 

VO m 



O CU 

8u 



o o 

Sm 



S Q 



f* CO 



8 

< CO 



< 

e 

< CO 

Sm 

3. 

o 

u ^ 
CJ Ou 



CJ o 

5. 



< co 

g. 



H CO 

6 o 

^ CO 

< CO 
r-t CO 



o cu 

g> 

M, 

< co 



Sm 



6 



o u 

Sm 



Sm 
g, 

a* 



$ 

< X 

g 

8< 
g 

o > 

CJ Q* 

g> 

< 2 



Sm 



0 o 

8< 

S Q 

^M 

s, 
s. 



Sm 

a 



i o 



< X 
p- TP 

CO r-4 

in 



Sm 



Sm 

cm cn 
vo m 



c5 O 



o> 

^ J 
C^ o 
C5 > 

g, 

9 a 



Sm 



O o 



cf w 

^ CO 



Sm 

a 

c5 Q 



Sm 



5 D 



Sm 

iL 



Sm 



§ O 
cm cn 

r-» CO 

go *r 



6 

< CO 

o 

5 a 
< 

8< 



O 

< CO 



MO 



o a 
c5 o 

6- >• 



§ O 



*1 



co 

co in 



WO 97/45439 



PCT/US97/09460 



19/33 



vO 

O vo 



3. 



«— « 00 
C\> to 



<0 

o 

m 

d 



^ to 

g 

< en 

< x 
< 

E- 
O 

< u> 

Sx 

o 



§x 



gx 



S 



u 

i. 

§< 



Sx 



o > 

is > 
o 

^ CO 



5. 



go 



co m 



< r 



S 

u a. 

So 



to 



VO CO 
Cs» VO 



So 

8 

< r 



< co 

6 

(J CL 



g 

u 

go 

8 U 



to m 







CO 




£~ 


















r» 






vo ro 


ro 


VO 


f-H CO 


o\ m 


o 


in 


«-t in 


»— i 


CM 




CN 



Go 
Sm 

I- 

O 

3. 

8 

< OS 

r- ^ 

co «-« 

f-H vo 
CM 



u 
u 
u 



XJ 

XJ 

XJ 
o 



xj 

XJ 



<0 

u 

nJ 
xj 

u 



o 

iJ 
10 



o 

a 

D) 

XJ 

Oi 
XJ 
u 
XJ 
J_> 

o» 
o 

<0 
(0 

u 

XJ 

u 
u 

<0 

o 

3. 



8- 



vo n 
eg vo 
cm 



(0 

u 

XJ 

XJ 
cn 
cn 

xJ 

x-> 

fO 

u 

(0 



(0 

XJ 



id 
u 
xj 
o> 
xj 

U 

(0 
(0 

cn 

(0 
xj 

fO 
XJ 

XJ 

(0 

XJ 

U 
XJ 
xJ 

u 
u 
(0 
J-> 
(0 
i-> 
(0 

XJ 

U 
xJ 

o» 

U 
(0 

u 

«TJ 
xj 
U 



<0 



WO 97/45439 



PCT/US97/09460 



20/33 



in in 
r- cm 



o in o o 
in o o» in o o 
m rg 



in m 
r- cm 



o o 
in in 



in «-« 



Q 
in 

O 



$ < 

co 

§> 

< 

< 

o 

o a: 

E- 

u 

8 o 



2 

n 

o x 

8 



< 

o a 



8 o 

§< 

2, 



u o» 



s 



a > 

13 > 
u 

(j a, 



go 
< 

8 o 



E- 

U O 

E- 
U 



E- 

S = 



r-» rsi 



U 

O Q 

6 

o a. 

< CO 

o > 

E-» 

!° 

3 cu 
o 



o o 
u o 



'-2 

o 

< a: 



< 



o 
o 

< CO 

So 



O a: 

o 
(J 

cj a, 
a 

^ CO 



a a* 

3. 



O w 
< 

go 



< H 



& 

E- 
U 

O < 



^ u. 

E-* 

6 a 

u 



So 



§ > 
Bo 



go 

o 
o 

< H 



s 

a, o > 



< 

o 

u a: 
< 



S a 



< CO 



o 



E- 

^ CO 

E- 

8< 

< 

O 

< a: 
< 

^ to 



£ g ¥ * 



§< 

u 

^ CO 

a 

CJ o 



o 



8 



go 



u o 
o 

St- 

E- 

^ 2 



2 





Is > 




< 


g 

< s 


< DC 


< 

Do: 




O 

^> 




a 

u o 




c o 




w— * 



WO 97/45439 



PCTAJS97/09460 



21/33 
FIG. 6A 




Not I 4661 

FIG. 6B 
ori 




WO 97/45439 



PCT/US97/09460 



22/33 



FIG. 6C 

A 



81.0 — : 



50.7 



41.5' 




0° 



<h hACAT, DM10 



SUBSTITUTE SHEET (RULE 26) 



WO 97/45439 



PCTYUS97/09460 



23/33 



O 

it, 




o 



7<S> 



in in 

ON K 

I I 



to 



i i i 




0> 



r-H 

s 



C 

u 



0> 

o 

Dh 



SUBSTITUTE SHEET (RULE 26) 



WO 97/45439 



PCT/US97/09460 



24/33 




SUBSTITUTE SHEET (RULE 26) 



WO 97/45439 



PCT/US97/09460 



25/33 



FIG. 9 




SUBSTITUTE SHEET (RULE 26) 



PCI7US97/09460 



26/33 



FIG. 10 




ARGP1 

RT-PCR product: 



SUBSTITUTE SHEET (RULE 26) 



WO 97/45439 



PCT/US97/09460 



27/33 



FIG. 11 



ARGPl 



1 MSNYRG I LNWCWMLI 15 

• : - I . I - : : . : I I 

hACAT 101 EGEKNNHRAKDLRAPPEQGK I F I ARRS LLDELLEVDH I RT I YHMF1 ALL I 150 

ARGPl 16 LSNARLFLENLIKYGILVDPIQWSLFLKDPHSWPAPCLVIAANVFAVAA 65 

I • '• • I • I II . : : : I . : : : : . . | . | : 

hACAT 151 LF I LSTLWDYI DEGRLVLEFSLLSYAFGKFPTWWTWWIMFLSTFSVPY 200 

ARGPl 66 FQVEKRLAVGALTEQA GLLLHVANLAT I LCFPAAWLLVES IT 108 

I • : ■ : : : I : I : : . : : : I . I . : . | : | . . s . 

hACAT 201 FLFQHWRTGYSKSSHPL1RSLFHGFLFMIFQIG . VLGFGPTYWLAYTLP 249 

ARGPl 109 PVGSLLALMAHTILFLKPFSYRDVNSWCRRARAKAASAGKKASSAAAPHT 158 

' - : • : : : : • : • : • : I : : I : • I • : : . . . : I . II-.: . | 

hACAT 250 PASRFIIIFEQIRFVMKA HSFVRENVPRVLNS AKEKSSTVPI PT 293 

ARGPl 159 VSYPD^TYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQ 208 

I- : • . I I I I I I I I . I .:||.| :|. :: . : : . : | . 

hACAT 294 VN QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMK . FAQVFGCF 334 

ARGPl 209 LQVGLIQQWMVPTIQNSMKPFKDMDYSRI IERLLKLAV PNHLIWL 253 

hACAT 335 FYVYYIFERLCAPL FRNIKQEPFSARVLVLCVFNSILPGVLILF 378 

ARGPl 254 IFFYWLFHSCLNAVAELMQFGDREFYRDWWNSESVTYFWQNWNIPVHKWC 303 

= I : - I : I I I . I I : : . I I I I I I: I I I I I. I . :: .. | | : . | | . | 
hACAT 379 LTFFAFLHCWLNAFAEMLRFGDRMFTKDWVWSTSYSNYYRTWNWVHDWL 428 

ARGPl 304 IRHFYKPMLRRGSSKWM. • ARTGVFLASAFFHEYLVSVPLR MFRL 346 

• • I I • : I : I - : : • I . : I I . I I . . I I I : . I . I . : I I 

hACAT 429 YYYAYKDFLWFFSKRFKSAAMLAVFAVSAWHEYALAVCLSFFYPVLFVL 478 

ARGPl 347 WAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLI IGQPI AVLMYVHDYYV 396 

: I I I . - I - : I I I : : I . . : : : | : : : | . 

hACAT 479 FMFFGM AFNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYA 524 

ARGPl 397 LNYEAPAAEA 406 

hACAT 525 RRHCPLKNPTFLDYVRPRSWTCRYVF 550 



WO 97/45439 



PCT/US97/09460 



28/33 




o 
a: 
< 



Eh 
< 



.o 



6^ 
< 



o 



Eh 
< 



O 



Eh 
< 



3 5 3 



CM 
O4 

O 
a: 
< 



Eh 
< 



WO 97/4543!> 



PCT/US97/09460 



29/33 



01 



rsi 

CO 













3 
















CN 






Q 




W 


ast 
























s 






o 


o 


S 







a, 



co vi Q Q 



< 
U 
< 

c 
6 



< 

U 
< 

(A 

O 



< 
< 

CO 



a, 



ft, 

o 
a; 



K 

J? c 
3 



fti 

a» 

in 
3 
O 

6 



u 



u 



u 



CO 

6 



WO 97/45439 



O Tt< 




6 



PCT/US97/09460 




WO 97/45439 



PCT/US97/09460 



31/33 

o m o o in in o o mm oo in in oo 

in in in o rM in oo <n in in cm r* oo <n in in 

f^rsj i-*in <n r* m *h nn mn vo rv* vo <n r** <n 

S« $s §> Bj 2* § K 2z 5x 

> «C/) 8< 8< Pu. ^ Pu, Oct Oo 3z 

t-u- ux Oj ua Uj Oo S> Co < >- go. 

h ^ oo. Oct Pu- <~ g< P> Cj < h b> 

§0: So 1* <e- £w £>j ^b, ua, §< 



5>j 5x §< So §b: ^> B 

5 B & f: B 5 M 9 g 



v fj c«l r*i n 

s. 



o a p a o 



8 b 8 y b 



g P eg ^ y 



w hu- uo <h <r h >« 00: (ij o,j 

P P P P 



<»»* Oo 0 «J 
Q 



<C«-i ^tn §< $ 0 goc 

^2 ^> 2z §< 2z U CL < § M 

O <3E 5> 5> 0< Gj So: 0Q Pc/> Oj 5 ~ 

o> UO o< b> hw Go: ocl Do: go < ~ 

P^fcyyggopu 

b> P u. ox 00 hu h >* a a- B> Oct 



u oa- o> o J o> <w £ u- Oo Pw 

2 



3d !?z So* $w t5> ^z ^ 

^z §*> §< 2h ^z £h S 



J $0 $ a 

£ p y g o p 

o 5> ox 5u < h <x 

B_ g„ B- I?_ 8. 8. » 



< 1-1 < < ih o< pto 5o Oa* F >- Su. 3o 

§U §0 fj> So §K 8< ^ 



S P % 5 P 

EJu 5> £>• o< 

^ 1^ £u §h 6j S< £h § 

^•H ^0, ^fiu ^ W ^ 

S< g> go. g w S< L I 

<X ^Z §0 |> 3*: ^ ^b. ^ W 

r* mm <s 00 <n mm c* 00 ? <n 



WO 97/45439 



PCTAJS97/09460 



32/33 

SfC g° JC«G §° ^ §o S!a> s s ^ 




Sje SS £5 55 US 2 j n 

£2 SS 3£ SS S S 5 



WO 97/45439 



PCT/US97/09460 



33/33 

jnjn So Sin gg jcjq ss E£ gg je £ je g § 



CO o\ 

c5< Ph. 3d Si Cj Sr 2z 8k S 1 o g 3 & 



6 8 P s M 



< S2 o S S S 

rt> 10 10 

*J U u <0 

. i-> O iJ « 

< t" «0 (0 O AJ 



§> §C5 ^lu i> 8< ^o. S< ^> ^ 

8q. 8a. 8* 5> g> £>. go. g s g x 8 8 g I 



o> Pj Gj 2z 8o 8o 9u g 0 i! 8 2 2 

£o <h Si 8* £c § H Do £ w 2 S S 8 

§< § M y< § 0 g 0 g» I f I g 

^ w ^ ° " " f * U i-> i> 

O C* 4J <0 



ucc Sac §cc §cc c5o ^ 



H >* p W jJ 0* u 



Ol <U U 



£ 8 a a 

<w 3< H >• OO Ph. 



g. 



8< Oj Pu. 8<* Pics n.i X ~ 2 $ H 2 



v> u 5 u e o u o ~ 



fS* 9«* fc* G « S 



<~ Eu. Sfz ^ 2 £j B 0 I,. g> 2 

. P u. 5j go Cj Pu. Sx So 5> g S 8 2 



o 

Bu §> Bs Eh. §* Eu. £ M £j S ? S § 

8< bi w Cj ^ ^ 8o 8o 8 ? S S 

E- t> C5 i3 o t3 ri rn f? ?! 2* S 



»< Si < h 80 3u Pj Do 2a iJ 2 1 & S 
^a. ^oc 2w ^ i § x § 0 ^ 

§> Eft. So §o §h § 



r: w o cj fi iJ o> w 

4-» <0 O* <0 

*TS 0> i-> O 

O U O 

#0 o> <0 

w S< g> B> 9u L 8< S! i! 5 S 

% 8 8 8 

o» 

w Q k 8.8 e a ^ ' 3 fi 5 



Eu. i> §< L ?o S 8 s « 

8 % % H y B K ' P 8 3 8 

OB. Pil So S^. fci^ 5- o o u o 



oo. Pu. JSq ^ w o> s<x Oou 5> Jb u § 

S< I. §« ^ B L S„ " " " 



JO g u 



5> S> Cj Pu. 8a. ^ 



to S 



^ £ fc iL ix g u II 

^ ix g, g> L j{, | | I f 8 

^ s. s. 8 Q g, g> § K I n I i 

r-fM «nu> «Ni- oo }SS S3 5£ gg p: j: e j: c 



_ oo r* r- r*. 

\nr* vo m so co S 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/09460 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :C07H 21/04; C12N 15/11; C12P 21/02 
VSCL :536723.2, 23.5; 435/70.1, 320.1, 325 

According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 536723.2, 23.5; 435/70.1, 320.1, 325 



Documentation searched other than minimum documentation to the extent that such documents arc included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 

APS, MEDLINE, BIOSJS, CA, DERWENT 

search terms: acvlcoenzyme A: cholesterol acyltransf erase 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category" 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Expressed Sequence Tag (EST) database Accession Numbers 
R99213 (13 September 1995), R99214 (13 September 
1995), H27149 (12 July 1995), N76754 (02 April 1996), 
H45923 (31 July 1995), H45924 (31 July 1995), H24791 
(07 July 1995), Z39933 (21 September 1995), T79408 (15 
March 1995), R48474 (18 May 1995), T79494 (15 March 
1995), R48475 (18 May 1995), M79086 (26 May 1992), 
and W10786 (26 April 1996). 

US 5,484,727 A (CHANG et al.) 16 January 1996, cols. 1-5. 



1-12 and 33 
13-32 



13-32 



| | Further documents are listed in the continuation of Box C. Q Sec patent family annex. 



SpeckJ categoric* of eked docH 

dm lancet defining foocnl ■ 
to be of particular relevance 



c of fee art whkh ia mat comidend 



earlier do 



t pubbabfid on or after the nternational films dale 



later document pubtiafced after the mtoaaliocaU filing date 
1 not in conflict with the application but cited to 
'cor theory undertyiof the invention 



of particular relevance; die cbjtnwd invention 
' novel or cannot be considered to involve an " 



the 
be 



cited to catabtiah the pubbcaboa date of another citation or other 
«oo(aa apectfal) 



l referring to aa oral dia rto e ur c, nac, trhibition or other 
ilrwnii ii m puoliabod prior to the international filing date but fatter damn 



of particular relevance; the classed tnveoboo cannot be 
i to involve an mauve atep when the 
with one or aaore other tucfa docinoeoai. socfa c 
obvious to • peraon afciUed a the art 

of the • 



Date of the actual completion of the tntcmationaj search 
19 AUGUST 1997 


Date of mailing of' the international search report 


Name and mailing address of the ISA/US 
Comminioocr of Patents and Trademarks 
Box PCT 

Wsahingtoa, D.C. 20231 
Facsimile No. (703) 305-3230 


Aulhorizedyofffeer/ / I / 
Tcleri^eUxL/r«nT30'8-0l95~ " 



Form PCT/IS A/210 (second sheet )(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCI7US97/09460 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This international report has not been established in respect of certain claims under Article 17(2K») for the following reasons: 

1. I I Claims Not.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



□ 



Claims Not.: 

because they relate to parts of the uiternattonal application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Q Claims Nos.: 

because they arc dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations wbere unity of invention is lacking (Continuation of keen 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet 



1 ' 1 ] rcquitd additional search fees were timely paid by the applicant, this international search report covers all searchable 



claims. 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. | ) As only some of the required additional search fees were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 



4 - [^] No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-33 



Remark on Protest [ | The additional search fees were accompanied by the applicant's protest. 

| j No protest accompanied the payment of additional search fees. 



Form PCTflSA/210 (continuation of first sheet(l)XJuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT7US97/09460 



BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains Che following inventions or groups of inventions which arc not so linked as to form a single 
» inventive concept under PCT Rule 13.1 . In order for all inventions to be searched, the appropriate additional search 

fees must be paid. 

Group I, claim(s)l-33, drawn to acylcocnzymc A: cholesterol acyttransferase fj and III nucleic acids and enzymes, and 
a first method of use comprising recombinant expression. 

Group II, claim(i) 34-39, drawn to a second method of use comprising detecting a defect in esteriftcatkm of sterol in a 
subject. 

Group III, claim (s) 40, drawn to third method of use comprising treating a subject having an unbalance in sterol levels. 
Group IV, claims 41-44, drawn to a fourth method of use comprising inhibiting wildtype acylcocnzymc A: cholesterol 
acyltransferase U or III in a subject. 

Group V, claim 45, drawn to a fifth method of use comprising identifying a chemical compound which is capable of 
inhibiting acylcocnzymc A: cholesterol acyttransferase II or ID in a subject 

Group VI, claims 46-48, drawn to an undefined pharmaceutical compound and methods of its use in treating a subject 
Group VII, claim 49, drawn to a transgenic, nonhuman mammal. 

The inventions listed as Groups I- VII do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: PCT Rule 
13.2 only provides for grouping together of a first product and a first method of use, rather than for multiple uses. 
Further, the compound and uses of group VI and the transgenic mammal of group VII lack the same or corresponding 
special technical feature as the other groups, which relate specifically to the acylcocnzymc Axholocstcrol 
acyttransferase II or III sequence. 



Form PCT/ISA/210 (extra sheet) (July 1992)* 



